Special Relativity 


Wikibooks.org 



March 12, 2013 


On the 28th of April 2012 the contents of the English as well as German Wikibooks and Wikipedia 
projects were licensed under Creative Commons Attribution-ShareAlike 3.0 Unported license. An 
URI to this license is given in the list of figures on page 155. If this document is a derived work 
from the contents of one of these projects and the content was still licensed by the project under 
this license at the time of derivation this document has to be licensed under the same, a similar or a 
compatible license, as stated in section 4b of the license. The list of contributors is included in chapter 
Contributors on page 151. The licenses GPL, LGPL and GFDL are included in chapter Licenses on 
page 159, since this book and/or parts of it may or may not be licensed under one or more of these 
licenses, and thus require inclusion of these licenses. The licenses of the figures are given in the list of 
figures on page 155. This PDF was generated by the IATj^X typesetting software. The IATj^X source 
code is included as an attachment (source.7z.txt) in this PDF hie. To extract the source from the 
PDF hie, we recommend the use of http://www.pdflabs.com/tools/pdftk-the-pdf-toolkit/ 
utility or clicking the paper clip attachment symbol on the lower left of your PDF Viewer, selecting 
Save Attachment. After extracting it from the PDF hie you have to rename it to source. 7z. To 
uncompress the resulting archive we recommend the use of http://www.7-zip.org/. The DTj^K 
source itself was generated by a program written by Dirk Hiinniger, which is freely available under 
an open source license from http://de.wikibooks.Org/wiki/Benutzer:Dirk_Huenniger/wb2pdf. 
This distribution also contains a configured version of the pdf latex compiler with all necessary 
packages and fonts needed to compile the DTj^X source included in this PDF hie. 



Contents 


1 Introduction 3 

1.1 Introduction 3 

1.2 Historical Development 4 

1.3 Intended Audience 11 

1.4 What's so special? 11 

1.5 Common Pitfalls in Relativity 11 

1.6 A Word about Wiki 12 

2 The principle of relativity 13 

2.1 The principle of relativity 13 

2.2 Inertial reference frames 21 

3 Spacetime 25 

3.1 The modern approach to relativity 25 

3.2 The lightcone 36 

3.3 The Lorentz transformation equations 38 

3.4 More about the relativity of simultaneity 39 

3.5 The nature of length contraction 44 

3.6 More about time dilation 46 

3.7 The twin paradox 47 

3.8 The Pole-barn paradox 54 

3.9 The transverse doppler effect 63 

3.10 Relativistic transformation of angles 63 

3.11 Addition of velocities 64 

3.12 Introduction 65 

3.13 Momentum 66 

3.14 Force 70 

3.15 Energy 71 

3.16 Nuclear Energy 75 

4 aether 77 

4.1 Introduction 77 

4.2 The aether drag hypothesis 78 

4.3 The Michelson-Morley experiment 83 

4.4 Mathematical analysis of the Michelson Morley Experiment 91 

4.5 Lorentz-Fitzgerald Contraction Hypothesis 95 

4.6 External links 96 

5 mathematical approach 97 

5.1 Vectors 97 


III 



Contents 


5.2 Matrices 103 

5.3 Indicia! Notation 108 

5.4 Analysis of curved surfaces and transformations Ill 

6 Waves 115 

6.1 Applications of Special Relativity 115 

6.2 External Links 121 

7 Mathematical Transformations 123 

7.1 The Lorentz transformation 123 

7.2 Hyperbolic geometry 138 

7.3 Mathematics of the Lorentz Transformation Equations 145 

8 Contributors 151 

List of Figures 155 

9 Licenses 159 

9.1 GNU GENERAL PUBLIC LICENSE 159 

9.2 GNU Free Documentation License 160 

9.3 GNU Lesser General Public License 161 


1 




1 Introduction 


1.1 Introduction 

The Special Theory of Relativity was the result of developments in physics at the end of the 
nineteenth century and the beginning of the twentieth century. It changed our understanding 
of older physical theories such as Newtonian Physics and led to early Quantum Theory and 
General Relativity. 

Special Relativity does not just apply to fast moving objects, it affects the everyday world 
directly through "relativistic" effects such as magnetism and the relativistic inertia that 
underlies kinetic energy and hence the whole of dynamics. 

Special Relativity is now one of the foundation blocks of physics. It is in no sense a provisional 
theory and is largely compatible with quantum theory; it not only led to the idea of matter 
waves but is the origin of quantum 'spin' and underlies the existence of the antiparticles. 
Special Relativity is a theory of exceptional elegance, Einstein crafted the theory from simple 
postulates about the constancy of physical laws and of the speed of light and his work has 
been refined further so that the laws of physics themselves and even the constancy of the 
speed of light are now understood in terms of the most basic symmetries in space and time. 

Further Reading 

Feynman Lectures on Physics. Symmetry in Physical Laws. (World Student) Vol 1. Ch 52. 

Gross, D.J. The role of symmetry in fundamental physics. PNAS December 10, 1996 vol. 93 
no. 25 14256-14259 http://www.pnas.org/content/93/25/14256.full 
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Figure 1 James Clerk Maxwell 

In the nineteenth century it was widely believed that light was propagated in a medium 
called the "aether". In 1865 James Clerk Maxwell produced a theory of electromagnetic 
waves that seemed to be based on this aether concept. According to his theory the velocity of 
electromagnetic waves such as light would depend on two constant factors, the permittivity 
and permeability constants, which were properties of the aether. Anyone who was stationary 
within the aether would measure the speed of light to be constant as a result of these 
constant properties of the aether. A light ray going from one stationary point to another in 
the aether would take the same amount of time to make the journey no matter who observed 
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it and observers would measure the velocity of any light that reached them as the sum of 
their velocity relative to the aether and the velocity of light in the aether. 

If space were indeed full of an aether then objects would move through this aether and this 
motion should be detectable. Maxwell proposed that the state of motion of an observer 
relative to an aether might be tested experimentally by reflecting beams of light at right 
angles to each other in an interferometer. As the interferometer was moved through the 
aether the addition of the velocity of the equipment to the velocity of the light in the aether 
would cause a distinctive interference pattern. Maxwell's idea was submitted as a letter to 
Nature in 1879 (posthumously). 

Albert Michelson read Maxwell's paper and in 1887 Michelson and Morley performed an 
'interferometer' experiment to test whether the observed velocity of light is indeed the sum 
of the speed of light in the aether and the velocity of the observer. To everyone's surprise the 
experiment showed that the speed of light was independent of the speed of the destination 
or source of the light in the proposed aether. 
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Figure 2 Albert Abraham Michelson 


How might this "null result" of the interferometer experiment be explained? How could the 
speed of light in a vacuum be constant for all observers no matter how they are moving 
themselves? It was possible that Maxwell's theory was correct but the theory about the way 
that velocities add together (known as Galilean Relativity) was wrong. Alternatively it was 
possible that Maxwell's theory was wrong and Galilean Relativity was correct. However, 
the most popular interpretation at the time was that both Maxwell and Galileo were correct 
and something was happening to the measuring equipment. Perhaps the instrument was 
being squeezed in some way by the aether or some other material effect was occurring. 
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Various physicists attempted to explain the Michelson and Morley experiment. George 
Fitzgerald (1889) and Hendrik Lorentz (1895) suggested that objects tend to contract along 
the direction of motion relative to the aether and Joseph Larmor (1897) and Hendrik Lorentz 
(1899) proposed that moving objects are contracted and that moving clocks run slow as 
a result of motion in the aether. Fitzgerald, Larmor and Lorentz 's contributions to the 
analysis of light propagation are of huge importance because they produced the "Lorentz 
Transformation Equations". The Lorentz Transformation Equations were developed to 
describe how physical effects would need to change the length of the interferometer arms and 
the rate of clocks to account for the lack of change in interference fringes in the interferometer 
experiment. It took the rebellious streak in Einstein to realise that the equations could also 
be applied to changes in space and time itself. 



Figure 3 Albert Einstein 
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By the late nineteenth century it was becoming clear that aether theories of light propagation 
were problematic. Any aether would have properties such as being massless, incompressible, 
entirely transparent, continuous, devoid of viscosity and nearly infinitely rigid. In 1905 
Albert Einstein realised that Maxwell's equations did not require an aether. On the basis of 
Maxwell's equations he showed that the Lorentz Transformation was sufficient to explain that 
length contraction occurs and clocks appear to go slow provided that the old Galilean concept 
of how velocities add together was abandoned. Einstein's remarkable achievement was to be 
the first physicist to propose that Galilean relativity might only be an approximation to 
reality. He came to this conclusion by being guided by the Lorentz Transformation Equations 
themselves and noticing that these equations only contain relationships between space and 
time without any references to the properties of an aether. 

In 1905 Einstein was on the edge of the idea that made relativity special. It remained for 
the mathematician Hermann Minkowski to provide the full explanation of why an aether 
was entirely superfluous. He announced the modern form of Special Relativity theory in 
an address delivered at the 80th Assembly of German Natural Scientists and Physicians on 
September 21, 1908. The consequences of the new theory were radical, as Minkowski put it: 

"The views of space and time which I wish to lay before you have sprung from the 
soil of experimental physics, and therein lies their strength. They are radical. 
Henceforth space by itself, and time by itself, are doomed to fade away into 
mere shadows, and only a kind of union of the two will preserve an independent 
reality. " 

What Minkowski had spotted was that Einstein's theory was actually related to the theories 
in differential geometry that had been developed by mathematicians during the nineteenth 
century. Initially Minkowski's discovery was unpopular with many physicists including 
Poincare, Lorentz and even Einstein. Physicists had become used to a thoroughly materialist 
approach to nature in which lumps of matter were thought to bounce off each other and 
the only events of any importance were those occurring at some universal, instantaneous, 
present moment. The possibility that the geometry of the world might include time as well 
as space was an alien idea. The possibility that phenomena such as length contraction could 
be due to the physical effects of spacetime geometry rather than the increase or decrease of 
forces between objects was as unexpected for physicists in 1908 as it is for the modern high 
school student. Einstein rapidly assimilated these new ideas and went on to develop General 
Relativity as a theory based on differential geometry but many of the earlier generation of 
physicists were unable to accept the new way of looking at the world. 

The adoption of differential geometry as one of the foundations of relativity theory has 
been traced by Walter (1999). Walter's study shows that by the 1920's modern differential 
geometry had become the principal theoretical approach to relativity, replacing Einstein's 
original electrodynamic approach. 



Historical Development 



Figure 4 Henri Poincare 


It has become popular to credit Henri Poincare with the discovery of the theory of Special 
Relativity, but Poincare got many of the right answers for some of the wrong reasons. He 
even came up with a version of E = me 2 . In 1904 Poincare had gone as far as to enunciate 
the "principle of relativity" in which "The laws of physical phenomena must be the same, 
whether for a fixed observer, as also for one dragged in a motion of uniform translation, so 
that we do not and cannot have any means to discern whether or not we are dragged in a 
such motion." Furthermore, in 1905 Poincare coined the term "Lorentz Transformation" for 
the equation that explained the null result of the Michelson Morley experiment. Although 
Poincare derived equations to explain the null result of the Michelson Morley experiment, 
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his assumptions were still based upon an aether. It remained for Einstein to show that an 
aether was unnecessary. 

It is also popular to claim that Special Relativity and aether theories such as those due 
to Poincare and Lorentz are equivalent and only separated by Occam's Razor. This is 
not strictly true. Occam's Razor is used to separate a complex theory from a simple 
theory, the two theories being different. In the case of Poincare's and Lorentz's aether 
theories both contain the Lorentz Transformation which is already sufficient to explain 
the Michelson and Morley Experiment, length contraction, time dilation etc. 
without an aether. The aether theorists simply failed to notice that this is a possibility 
because they rejected spacetime as a concept for reasons of philosophy or prejudice. In 
Poincare's case he rejected spacetime because of philosophical objections to the idea of 
spatial or temporal extension (see note 1). 

It is curious that Einstein actually returned to thinking based on an aether for philosophical 
reasons similar to those that haunted Poincare (See Granek 2001). The geometrical form 
of Special Relativity as formalised by Minkowski does not forbid action at a distance and 
this was considered to be dubious philosophically. This led Einstein, in 1920, to reintroduce 
some of Poincare's ideas into the theory of General Relativity. Whether an aether of the 
type proposed by Einstein is truly required for physical theory is still an active question in 
physics. However, such an aether leaves the spacetime of Special Relativity almost intact 
and is a complex merger of the material and geometrical that would be unrecognised by 
19th century theorists. 

• Einstein, A. (1905). Zur Elektrodynamik bewegter Kdrper, in Annalen der Physik. 
17:891-921. http : //www . f ourmilab . ch/etexts/einstein/specrel/www/ 

• Granek, G (2001). Einstein's ether: why did Einstein come back to the ether? Ape- 

iron, Vol 8, 3. http://citeseer.ist.psu.edu/cache/papers/cs/32948/http: 

zSzzSzredshift . vif . comzSzJournalFileszSzV08N03PDFzSzV08N3GRF . PDF/ 
granekOleinsteins . pdf 

• S. Walter. The non-Euclidean style of Minkowskian relativity. Published in J. Gray 
(ed.), The Symbolic Universe, Oxford University Press, 1999, 91-127. http: //www. 
univ-nancy2 . f r/DepPhilo/walter/papers/nes . pdf 

• G. F. FitzGerald (1889), The Ether and the Earth’s Atmosphere, Science 13, 390. 

• Larmor, J. (1897), On a Dynamical Theory of the Electric and Luminiferous Medium, 
Part 3, Relations with material media, Phil. Trans. Roy. Soc. 190: 205-300, 
doi: 10. 1098/rsta. 1897.0020 

• H. A. L. Lorentz (1895), Versuch einer Theorie der electrischen und optischen Erschein- 
ungen in bewegten Korpern, Brill, Leyden. 

• S. Walter. (1999), The non-Euclidean style of Minkowskian relativity. Published in 
J. Gray (ed.), The Symbolic Universe, Oxford University Press, 1999, 91-127. http: 
//www . univ-nancy2 . f r/DepPhilo/walter/papers/nes . pdf 

Note 1: The modern philosophical objection to the spacetime of Special Relativity is that it 
acts on bodies without being acted upon, however, in General Relativity spacetime is acted 
upon by its content. 


10 



Intended Audience 


1.3 Intended Audience 

This book presents special relativity (SR) from first principles and logically arrives at the 
conclusions. There will be simple diagrams and some thought experiments. Although the 
final form of the theory came to use Minkowski spaces and metric tensors, it is possible to 
discuss SR using nothing more than high school algebra. That is the method used here in 
the first half of the book. That being said, the subject is open to a wide range of readers. 
All that is really required is a genuine interest. 

For a more mathematically sophisticated treatment of the subject, please refer to the 
Advanced Text in Wikibooks. 

The book is designed to confront the way students fail to understand the relativity of 
simultaneity. This problem is well documented and described in depth in: Student 

understanding of time in special relativity: simultaneity and reference frames 1 by Scherr et 
al. 

1.4 What's so special? 

The special theory was suggested in 1905 in Einstein's article "On the Electrodynamics of 
Moving Bodies", and is so called because it applies in the absence of non-uniform gravitational 
fields. 

In search of a more complete theory, Einstein developed the general theory of relativity 
published in 1915. General relativity (GR), a more mathematically demanding subject, 
describes physics in the presence of gravitational fields. 

The conceptual difference between the two is the model of spacetime used. Special relativity 
makes use of a Euclidean-like (flat) spacetime. GR lives in a spacetime that is generally not 
flat but curved, and it is this curvature which represents gravity. The domain of applicability 
for SR is not so limited, however. Spacetime can often be approximated as flat, and there 
are techniques to deal with accelerating special relativistic objects. 


1.5 Common Pitfalls in Relativity 

Here is a collection of common misunderstandings and misconceptions about SR. If you 
are unfamiliar with SR then you can safely skip this section and come back to it later. If 
you are an instructor, perhaps this can help you divert some problems before they start by 
bringing up these points during your presentation when appropriate. 

Beginners often believe that special relativity is only about objects that are moving at high 
velocities. Strictly speaking, this is a mistake. Special relativity applies at all velocities 
but at low velocity the predictions of special relativity are almost identical to those of the 
Newtonian empirical formulae. As an object increases its velocity the predictions of relativity 
gradually diverge from Newtonian Mechanics. 


1 http : / / arxiv. org/ftp/physic s/paper s/0207/0207 109 .pdf 
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There is sometimes a problem differentiating between the two different concepts "relativity of 
simultaneity" and "signal latency/delay." This book text differs from some other presentations 
because it deals with the geometry of spacetime directly and avoids the treatment of delays 
due to light propagation. This approach is taken because students would not be taught 
Euclid's geometry using continuous references to the equipment and methods used to measure 
lengths and angles. Continuous reference to the measurement process obscures the underlying 
geometrical theory whether the geometry is three dimensional or four dimensional. 

If students do not grasp that, from the outset, modern Special Relativity proposes that the 
universe is four dimensional, then, like Poincare, they will consider that the constancy of 
the speed of light is just an event awaiting a mechanical explanation and waste their time 
by pondering the sorts of mechanical or electrical effects that could adjust the velocity of 
light to be compatible with observation. 


1.6 A Word about Wiki 

This is a wikibook. That means it has great potential for improvement and enhancement. 
The improvement can be in the form of refined language, clear mathematics, simple diagrams, 
and better practice problems and answers. The enhancement can be in the form of artwork, 
historical context of SR, anything. Feel free to improve and enhance Special Relativity and 
other wikibooks as you see necessary. 
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2.1 The principle of relativity 



Figure 5 Galileo Galilei 
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Principles of relativity address the relationship between observations made at different places. 
This problem has been a difficult theoretical challenge since the earliest times and involves 
physical questions such as how the velocities of objects can be combined and how influences 
are transmitted between moving objects. 

One of the most fruitful approaches to this problem was the investigation of how observations 
are affected by the velocity of the observer. This problem had been tackled by classical 
philosophers but it was the work of Galileo that produced a real breakthrough. Galileo 1 
(1632), in his "Dialogue Concerning the Two Chief World Systems", considered observations 
of motion made by people inside a ship who could not see the outside: 

"have the ship proceed with any speed you like, so long as the motion is uniform and not 
fluctuating this way and that. You will discover not the least change in all the effects 
named, nor could you tell from any of them whether the ship was moving or standing still. 


According to Galileo, if the ship moved smoothly then someone inside it would be unable to 
determine whether they were moving. If people in Galileo's moving ship were eating dinner 
they would see their peas fall from their fork straight down to their plate in the same way 
as they might if they were at home on dry land. The peas move along with the people and 
do not appear to the diners to fall diagonally. This means that the peas continue in a state 
of uniform motion unless someone intercepts them or otherwise acts on them. It also means 
that simple experiments that the people on the ship might perform would give the same 
results on the ship or at home. This concept led to “Galilean Relativity” in which it was 
held that things continue in a state of motion unless acted upon and that the laws of physics 
are independent of the velocity of the laboratory. 

This simple idea challenged the previous ideas of Aristotle. Aristotle 2 had argued in his 
Physics that objects must either be moved or be at rest. According to Aristotle, on the basis 
of complex and interesting arguments about the possibility of a 'void', objects cannot remain 
in a state of motion without something moving them. As a result Aristotle proposed that 
objects would stop entirely in empty space. If Aristotle were right the peas that you dropped 
whilst dining aboard a moving ship would fall in your lap rather than falling straight down 
on to your plate. Aristotle's idea had been believed by everyone so Galileo's new proposal 
was extraordinary and, because it was nearly right, became the foundation of physics. 

Galilean Relativity contains two important principles: firstly it is impossible to determine 
who is actually at rest and secondly things continue in uniform motion unless acted upon. 
The second principle is known as Galileo’s Law of Inertia or Newton's First Law of Motion. 

Reference: 

• Galileo Galilei (1632). Dialogues Concerning the Two Chief World Systems 3 . 

• Aristotle (350BC). Physics. http://classics.mit.edu/Aristotle/physics.html 


1 http : //en . Wikipedia. org/wiki/Galileo 

2 http : //en . Wikipedia. org/wiki/Aristotle 

3 http : //en . Wikipedia. org/wiki/Dialogue_Concerning_the_Two_Chief _World_Systems 
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2.1.1 Special relativity 

Until the nineteenth century it appeared that Galilean relativity treated all observers as 
equivalent no matter how fast they were moving. If you throw a ball straight up in the air 
at the North Pole it falls straight back down again and this also happens at the equator 
even though the equator is moving at almost a thousand miles an hour faster than the pole. 
Galilean velocities are additive so that the ball continues moving at a thousand miles an 
hour when it is thrown upwards at the equator and continues with this motion until it is 
acted on by an external agency. 

This simple scheme became questioned in 1865 when James Clerk Maxwell discovered the 
equations that describe the propagation of electromagnetic waves such as light. His equations 
showed that the speed of light depended upon constants that were thought to be simple 
properties of a physical medium or “aether” that pervaded all space. If this were the case 
then, according to Galilean relativity, it should be possible to add your own velocity to the 
velocity of incoming light so that if you were travelling at a half the speed of light then any 
light approaching you would be observed to be travelling at 1.5 times the speed of light in 
the aether. Similarly, any light approaching you from behind would strike you at 0.5 times 
the speed of light in the aether. Light itself would always go at the same speed in the aether 
so if you shone a light from a torch whilst travelling at high speed the light would plop into 
the aether and slow right down to its normal speed. This would spoil Galileo's Relativity 
because all you would need to do to discover whether you were in a moving ship or on dry 
land would be to measure the speed of light in different directions. The light would go slower 
in your direction of travel through the aether and faster in the opposite direction. 

If the Maxwell equations are valid and the simple classical addition of velocities applies 
then there should be a preferred reference frame, the frame of the stationary aether. 
The preferred reference frame would be considered the true zero point to which all velocity 
measurements could be referred. 

Special relativity restored a principle of relativity in physics by maintaining that Maxwell's 
equations are correct but that classical velocity addition is wrong: there is no preferred 
reference frame. Special relativity brought back the interpretation that in all inertial 
reference frames the same physics is going on and there is no phenomenon that would allow 
an observer to pinpoint a zero point of velocity. Einstein 4 preserved the principle of relativity 
by proposing that the laws of physics are the same regardless of the velocity of the observer. 
According to Einstein, whether you are in the hold of Galileo's ship or in the cargo bay of a 
space ship going at a large fraction of the speed of light the laws of physics will be the same. 

Einstein's idea shared the same philosophy as Galileo's idea, both men believed that the 
laws of physics would be unaffected by motion at a constant velocity. In the years between 
Galileo and Einstein it was believed that it was the way velocities simply add to each other 
that preserved the laws of physics but Einstein adapted this simple concept to allow for 
Maxwell's equations. 


4 http : //en . wikibooks . org/wiki/y o 3AWikipediay o 3AAlbert7o20Einstein 
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2.1.2 Frames of reference, events and transformations 

Before proceeding further with the analysis of relative motion the concepts of reference 
frames, events and transformations need to be defined more closely. 

Physical observers are considered to be surrounded by a reference frame which is a set of 
coordinate axes in terms of which position or movement may be specified or with reference 
to which physical laws may be mathematically stated. 

An event is something that happens independently of the reference frame that might be 
used to describe it. Turning on a light or the collision of two objects would constitute an 
event. 

Suppose there is a small event, such as a light being turned on, that is at coordinates x,y,z,t 
in one reference frame. What coordinates would another observer, in another reference 
frame moving relative to the first at velocity v along the x axis assign to the event? This 
problem is illustrated below: 


Transformation of Coordinates 


z 


s 


s 


Event 



The observers are moving at a relative velocity of v and each 
observer has their own set of coordinates (xw^t) and 
(x\y\z\t'). What coordinates do they assign to the event? 

Figure 6 

What we are seeking is the relationship between the second observer's coordinates for the 

event x ,y and the first observer's coordinates for the event x,y,z,t. The coordinates 
refer to the positions and timings of the event that are measured by each observer and, 
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for simplicity, the observers are arranged so that they are coincident at t=0. According to 
Galilean Relativity: 

x =x ~ v t 



t =t 

This set of equations is known as a Galilean coordinate transformation or Galilean 
transformation. 


These equations show how the position of an event in one reference frame is related to the 
position of an event in another reference frame. But what happens if the event is something 
that is moving? How do velocities transform from one frame to another? 

The calculation of velocities depends on Newton's formula: v = dx/dt. The use of Newtonian 
physics to calculate velocities and other physical variables has led to Galilean Relativity 
being called Newtonian Relativity in the case where conclusions are drawn beyond simple 
changes in coordinates. The velocity transformations for the velocities in the three directions 
in space are, according to Galilean relativity: 

U x =Ux-V 

u y=Uy 


/ y,UZ 

Where u x,u are the velocities of a moving object in the three directions in space recorded 
by the second observer and u x ,u y ,u z are the velocities recorded by the first observer and v 
is the relative velocity of the observers. The minus sign in front of the v means the moving 
object is moving away from both observers. 

This result is known as the classical velocity addition theorem and summarises the 
transformation of velocities between two Galilean frames of reference. It means that the 
velocities of projectiles must be determined relative to the velocity of the source and 
destination of the projectile. For example, if a sailor throws a stone at 10 km/hr from 
Galileo's ship which is moving towards shore at 5 km/hr then the stone will be moving at 
15 km/hr when it hits the shore. 

In Newtonian Relativity the geometry of space is assumed to be Euclidean and the measure- 
ment of time is assumed to be the same for all observers. 

The derivation of the classical velocity addition theorem is as follows If the Galilean 
transformations are differentiated with respect to time: 


x'= x ~ vt 


So: 
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/ dt=dx / dt—v 


I , / : id = dx 

But in Galilean relativity t and so dx ' therefore 

d x '/ dt= dx / dt ~ v 
dy'/ dt= d y/ dt 
^ ' /dt= dz / dt 


. / 

t r • / / dt i 

If we write u x ~ etc. then: 


2.1.3 The postulates of special relativity 

In the previous section transformations from one frame of reference to another were described 
using the simple addition of velocities that were introduced in Galileo's time and these 
transformations are consistent with Galileo's main postulate which was that the laws of 
physics would be the same for all inertial observers so that no-one could tell who was at 
rest. Aether theories had threatened Galileo's postulate because the aether would be at rest 
and observers could determine that they were at rest simply by measuring the speed of light 
in the direction of motion. Einstein preserved Galileo's fundamental postulate that the laws 
of physics are the same in all inertial frames of reference but to do so he had to introduce a 
new postulate that the speed of light would be the same for all observers. These postulates 
are listed below: 

1. First postulate: the principle of relativity 

Formally: the laws of physics are the same in all inertial frames of reference. 

Informally: every physical theory should look the same mathematically to every inertial 
observer. Experiments in a physics laboratory in a spaceship or planet orbiting the sun and 
galaxy will give the same results no matter how fast the laboratory is moving. 

2. Second postulate: the invariance of the speed of light 

Formally: the speed of light in free space is a constant in all inertial frames of 
reference. 
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Informally: the speed of light in a vacuum, commonly denoted c, is the same for all inertial 
observers, is the same in all directions, and does not depend on the velocity of the object 
emitting the light. 

Using these postulates Einstein was able to calculate how the observation of events depends 
upon the relative velocity of observers. He was then able to construct a theory of physics 
that led to predictions such as the equivalence of mass and energy and early quantum theory. 

Einstein's formulation of the axioms of relativity is known as the electrodynamic approach 
to relativity. It has been superseded in most advanced textbooks by the “space-time 
approach” in which the laws of physics themselves are due to symmetries in space-time and 
the constancy of the speed of light is a natural consequence of the existence of space-time. 
However, Einstein's approach is equally valid and represents a tour de force of deductive 
reasoning which provided the insights required for the modern treatment of the subject. 


2.1.4 Einstein's Relativity - the electrodynamic approach 

Einstein asked how the lengths and times that are measured by the observers might need to 
vary if both observers found that the speed of light was constant. He looked at the formulae 
for the velocity of light that would be used by the two observers, {x = ct ) and {x =ct , and 
asked what constants would need to be introduced to keep the measurement of the speed 
of light at the same value even though the relative motion of the observers meant that the 
x axis was continually expanding. His working is shown in detail in the appendix 5 . The 
result of this calculation is the Lorentz Transformation Equations: 

x' = y(a: — vt) 

y' = y 


/ 

z 


= z 


t' = 7 (t 


vx 


) 


Where the constant 7 = 


These equations apply to any two observers in relative 


1 -! 


motion but note that the sign within the brackets changes according to the direction of the 
velocity - see the appendix 6 . 


The Lorentz Transformation is the equivalent of the Galilean Transformation with the 
added assumption that everyone measures the same velocity for the speed of light no matter 
how fast they are travelling. The speed of light is a ratio of distance to time (ie: metres 
per second) so for everyone to measure the same value for the speed of light the length of 


5 Chapter 7.3 on page 145 

6 Chapter 7.3 on page 145 
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measuring rods, the length of space between light sources and receivers and the number of 
ticks of clocks must dynamically differ between the observers. So long as lengths and time 
intervals vary with the relative velocity of two observers (v) as described by the Lorentz 
Transformation the observers can both calculate the speed of light as the ratio of the distance 
travelled by a light ray divided by the time taken to travel this distance and get the same 
value. 

Einstein's approach is "electrodynamic'' because it assumes, on the basis of Maxwell's 
equations, that light travels at a constant velocity. As mentioned above, the idea of 
a universal constant velocity is strange because velocity is a ratio of distance to time. 
Do the Lorentz Transformation Equations hide a deeper truth about space and time? 
Einstein himself (Einstein 1920) gives one of the clearest descriptions of how the Lorentz 
Transformation equations are actually describing properties of space and time itself. His 
general reasoning is given below. 

If the equations are combined they satisfy the relation: 

(1) x' 2 - c 2 t 2 = x 2 - c 2 t 2 

Einstein (1920) describes how this can be extended to describe movement in any direction 
in space: 

(2) x 2 + y 2 + z 2 — c 2 t 2 = x 2 + y 2 + z 2 — c 2 t 2 

Equation (2) is a geometrical postulate about the relationship between lengths and times 
in the universe. It suggests that there is a constant s such that: 


2 '2 i '2 , '2 2/2 

s = x +y +z — ct 


2 2 , 2,2 2 j .2 

s = x +y +z — ct 

This equation was recognised by Minkowski as an extension of Pythagoras' Theorem (ie: 
s 2 = x 2 + y 2 ), such extensions being well known in early twentieth century mathematics. 
What the Lorentz Transformation is telling us is that the universe is a four dimensional 
spacetime and as a result there is no need for any "aether". (See Einstein 1920, appendices, 
for Einstein's discussion of how the Lorentz Transformation suggests a four dimensional 
universe but be cautioned that "imaginary time" has now been replaced by the use of "metric 
tensors"). 

Einstein's analysis shows that the x-axis and time axis of two observers in relative motion 
do not overlie each other, The equation relating one observer's time to the other observer's 
time shows that this relationship changes with distance along the x-axis ie: 


t' = j(t 


vx 

^2 


) 


This means that the whole idea of "frames of reference" needs to be re-visited to allow for 
the way that axes no longer overlie each other. 


20 



Inertial reference frames 


Einstein, A. (1920). Relativity. The Special and General Theory. Methuen & Co Ltd 
1920. Written December, 1916. Robert W. Lawson (Authorised translation). http://www. 
bartleby . com/ 173/ 


2.2 Inertial reference frames 


The Lorentz Transformation for time involves a component ( vx/c 2 ) which results in time 
measurements being different along the x-axis of relatively moving observers. This means 
that the old idea of a frame of reference that simply involves three space dimensions with 
a time that is in common between all of the observers no longer applies. To compare 
measurements between observers the concept of a "reference frame" must be extended to 
include the observer's clocks. 

An inertial reference frame is a conceptual, three-dimensional latticework of measuring 
rods set at right angles to each other with clocks at every point that are synchronised with 
each other (see below for a full definition). An object that is part of, or attached to, an 
inertial frame of reference is defined as an object which does not disturb the synchronisation 
of the clocks and remains at a constant spatial position within the reference frame. The 
inertial frame of reference that has a moving, non-rotating body attached to it is known as 
the inertial rest frame for that body. An inertial reference frame that is a rest frame for 
a particular body moves with the body when observed by observers in relative motion. 
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Spatial representation of an 
inertial frame of reference 
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z axis 



Space-time representation of an 
inertial frame of reference 


Figure 7 


This type of reference frame became known as an "inertial" frame of reference because, as will 
be seen later in this book, each system of objects that are co-moving according to Newton's 
law of inertia (without rotation, gravitational fields or forces acting) have a common rest 
frame, with clocks that differ in synchronisation and rods that differ in length, from those in 
other, relatively moving, rest frames. 

There are many other definitions of an "inertial reference frame" but most of these, such 
as "an inertial reference frame is a reference frame in which Newton's First Law is valid" 
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do not provide essential details about how the coordinates are arranged and/or represent 
deductions from more fundamental definitions. 

The following definition by Blandford and Thorne (2004) is a fairly complete summary of 
what working physicists mean by an inertial frame of reference: 

"An inertial reference frame is a (conceptual) three-dimensional latticework of measuring 
rods and clocks with the following properties: (i ) The latticework moves freely through 
spacetime (i.e. , no forces act on it), and is attached to gyroscopes so it does not rotate with 
respect to distant, celestial objects, (ii ) The measuring rods form an orthogonal lattice and 
the length intervals marked on them are uniform when compared to, e.g., the wavelength of 
light emitted by some standard type of atom or molecule; and therefore the rods form an 
orthonormal, Cartesian coordinate system with the coordinate x measured along one axis, y 
along another, and z along the third, (iii ) The clocks are densely packed throughout the 
latticework so that, ideally, there is a separate clock at every lattice point, (iv ) The clocks 
tick uniformly when compared, e.g., to the period of the light emitted by some standard 
type of atom or molecule; i.e., they are ideal clocks, (v) The clocks are synchronized by the 
Einstein synchronization process: If a pulse of light, emitted by one of the clocks, bounces 
off a mirror attached to another and then returns, the time of bounce tb as measured by 
the clock that does the bouncing is the average of the times of emission and reception as 
measured by the emitting and receiving clock: tb = l/2(t e + t r ). 1 

1 For a deeper discussion of the nature of ideal clocks and ideal measuring rods see, e.g., pp. 
23-29 and 395-399 of Misner, Thorne, and Wheeler (1973)." 

Special Relativity demonstrates that the inertial rest frames of objects that are moving 
relative to each other do not overlay one another. Each observer sees the other, moving 
observer's, inertial frame of reference as distorted. This discovery is the essence of Special 
Relativity and means that the transformation of coordinates and other measurements 
between moving observers is complicated. It will be discussed in depth below. 
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Relativit\ T shows that the inertial frames of 
reference of relatively moving objects do not 
overlie each other. 

Figure 8 


Blandford, R.D. and Thorne, K.S.(2004). Applications of Classical Physics. California 
Institute of Technology. See: http://www.pma.caltech.edu/Courses/phl36/yr2004/ 
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3.1 The modern approach to relativity 


Although the special theory of relativity was first proposed by Einstein in 1905, the modern 
approach to the theory depends upon the concept of a four-dimensional universe, that 
was first proposed by Hermann Minkowski in 1908. This approach uses the concept of 
invariance to explore the types of coordinate systems that are required to provide a full 
physical description of the location and extent of things. 

The modern theory of special relativity begins with the concept of "length". In everyday 
experience, it seems that the length of objects remains the same no matter how they are 
rotated or moved from place to place. We think that the simple length of a thing is "invariant". 
However, as is shown in the illustrations below, what we are actually suggesting is that 
length seems to be invariant in a three-dimensional coordinate system. 


Figure 1: Invariance of length on a Euclidean plane. 




The length of a thing in a two dimensional 
coordinate system is given by Pythagoras' 
theorem. h 2 = x 2 + y 2 


In the 2D plane length is invanant 
during rotations on the plane. 

The length is also invanant if the 
thing is just moved from place to 
place (translational invariance). 


If a thing rotates out of the plane 
the length it projects on the plane 
is no longer equal to the real length 
of the thing. 



Figure 9 


The length of a thing in a two-dimensional coordinate system is given by Pythagoras's 
theorem: 


2,2 7 2 

x +y = h 

This two-dimensional length is not invariant if the thing is tilted out of the two-dimensional 
plane. In everyday life, a three-dimensional coordinate system seems to describe the length 
fully. The length is given by the three-dimensional version of Pythagoras's theorem: 
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h 2 


2 , 2 , 2 
x + y + z 


The derivation of this formula is shown in the illustration below. 


Figure 2: Invariance in a 3D Euclidean space. 



The length of an object in a three dimensional 
coordinate system is given by the 3D version of 
Pythagoras' theorem: 


,2 , 2 , 2 , ,2 2 , 2 

k = h + z but h = x + y 

.2 2 2 2 

k = x + y + z 


Figure 10 


Y 



In a three dimensional coordinate system it 
seems that the real length of a thing stays the 
same (is ESTVAPIANT) during translations and 
rotations. It appears to be always given by: 


It seems that, provided all the directions in which a thing can be tilted or arranged are 
represented within a coordinate system, then the coordinate system can fully represent the 
length of a thing. However, it is clear that things may also be changed over a period of time. 
Time is another direction in which things can be arranged. This is shown in the following 
diagram: 
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Figure 3: The invariant space-time interval. 


T - time T - time 



Motions can be represented as lengths spanning both space and time in a coordinate system. These 
lengths are called SPACE-TIME INTERVALS. Time can be considered to be yet another direction 
for arranging things. This suggests that the universe could be four dimensional. If the universe is truly 
four dimensional then space-time intervals would be invariant when things move . 


Figure 11 


The path taken by a thing in both space and time is known as the space-time interval. 

In 1908 Hermann Minkowski pointed out that if things could be rearranged in time, then the 
universe might be four-dimensional. He boldly suggested that Einstein's recently-discovered 
theory of Special Relativity was a consequence of this four- dimensional universe. He proposed 
that the space-time interval might be related to space and time by Pythagoras' theorem in 
four dimensions: 


2 2 , 2 , 2 i /• ,\2 

s = x +y +z + (icb) 

Where i is the imaginary unit 1 (sometimes imprecisely called \J — 1), c is a constant, and t 
is the time interval spanned by the space-time interval, s. The symbols x, y and z represent 
displacements in space along the corresponding axes. In this equation, the 'second' becomes 
just another unit of length. In the same way as centimetres and inches are both units of 
length related by centimetres = 'conversion constant' times inches, metres and seconds 
are related by metres = 'conversion constant' times seconds. The conversion constant, c 
has a value of about 300,000,000 meters per second. Now i 2 is equal to minus one, so the 
space-time interval is given by: 


1 http : //en . Wikipedia . org/wiki/yo20imaginaryy o 20unity o 20 
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2 2 , 2 i 2 / j.\2 

s = x +y + z — ( ct ) 

Minkowski's use of the imaginary unit has been superseded by the use of advanced geometry 
that uses a tool known as the "metric tensor". The metric tensor permits the existence of 
"real" time and the negative sign in the expression for the square of the space-time interval 
originates in the way that distance changes with time when the curvature of spacetime is 
analysed (see advanced text). We now use real time but Minkowski's original equation for 
the square of the interval survives so that the space-time interval is still given by: 

s = x +y + z — (ct) 

Space-time intervals are difficult to imagine; they extend between one place and time and 
another place and time, so the velocity of the thing that travels along the interval is already 
determined for a given observer. 

If the universe is four- dimensional, then the space-time interval will be invariant, rather 
than spatial length. Whoever measures a particular space-time interval will get the same 
value, no matter how fast they are travelling. In physical terminology the invariance of 
the spacetime interval is a type of Lorentz Invariance. The invariance of the spacetime 
interval has some dramatic consequences. 

The first consequence is the prediction that if a thing is travelling at a velocity of c metres 
per second, then all observers, no matter how fast they are travelling, will measure the same 
velocity for the thing. The velocity c will be a universal constant. This is explained below. 

When an object is travelling at c, the space time interval is zero, this is shown below: 

The distance travelled by an object moving at velocity v in the x direction for t seconds is: 


x = vt 

If there is no motion in the y or z directions the space-time interval is s 2 = x 2 + 0 + 0 — (ct) 2 

So: s 2 = (vt.) 2 — (ct) 2 

But when the velocity v equals c: 


s 2 = (ct) 2 — (ct) 2 

And hence the space time interval s 2 = (ct.) 2 — (ct.) 2 = 0 

A space-time interval of zero only occurs when the velocity is c (if x>0). All observers 
observe the same space-time interval so when observers observe something with a space-time 
interval of zero, they all observe it to have a velocity of c, no matter how fast they are 
moving themselves. 

The universal constant, c, is known for historical reasons as the "speed of light in a vacuum". 
In the first decade or two after the formulation of Minkowski's approach many physicists, 
although supporting Special Relativity, expected that light might not travel at exactly c, 
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but might travel at very nearly c. There are now few physicists who believe that light in a 
vacuum does not propagate at c. 

The second consequence of the invariance of the space-time interval is that clocks will appear 
to go slower on objects that are moving relative to you. Suppose there are two people, Bill 
and John, on separate planets that are moving away from each other. John draws a graph 
of Bill's motion through space and time. This is shown in the illustration below: 


Figure 4: John and Bill - observers moving away from each other. 



Time 



Distance 


John's graph of Bill's motion through time and 
space. J ohn thinks Bill is going through both 
time and space but Bill thinks he's only going 
through time. 


Bill thinks his p ath is 
just through time (the 
dotted line) but J ohn 
thinks Bill takes the 
solid line. 


Figure 12 


Being on planets, both Bill and John think they are stationary, and just moving through 
time. John spots that Bill is moving through what John calls space, as well as time, when 
Bill thinks he is moving through time alone. Bill would also draw the same conclusion about 
John's motion. To John, it is as if Bill's time axis is leaning over in the direction of travel 
and to Bill, it is as if John's time axis leans over. 

John calculates the length of Bill's space-time interval as: 

S ' 2 = ( vt ) 2 — (cf ) 2 

whereas Bill doesn't think he has travelled in space, so writes: 

s 2 = (0) 2 -( c r) 2 

The space-time interval, s 2 , is invariant. It has the same value for all observers, no matter 
who measures it or how they are moving in a straight line. Bill's s 2 equals John's s 2 so: 
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(0 ) 2 -{cT) 2 = {vt) 2 -{ct) 2 

and 

-(cT) 2 = (ut) 2 -(ct) 2 

hence 


t = T/^]l-v 2 /c 2 

So, if John sees Bill measure a time interval of 1 second (T = 1) between two ticks of a clock 
that is at rest in Bill's frame, John will find that his own clock measures between these same 
ticks an interval t, called coordinate time, which is greater than one second. It is said 
that clocks in motion slow down, relative to those on observers at rest. This is known as 
"relativistic time dilation of a moving clock". The time that is measured in the rest frame of 
the clock (in Bill's frame) is called the proper time of the clock. 

John will also observe measuring rods at rest on Bill's planet to be shorter than his own 
measuring rods, in the direction of motion. This is a prediction known as "relativistic length 
contraction of a moving rod". If the length of a rod at rest on Bill's planet is X, then we 
call this quantity the proper length of the rod. The length x of that same rod as measured 
on John's planet, is called coordinate length, and given by 


x = X\J 1 — v 2 /c 2 

This equation can be derived directly and validly from the time dilation result with the 
assumption that the speed of light is constant. 

The last consequence is that clocks will appear to be out of phase with each other along the 
length of a moving object. This means that if one observer sets up a line of clocks that are 
all synchronised so they all read the same time, then another observer who is moving along 
the line at high speed will see the clocks all reading different times. In other words observers 
who are moving relative to each other see different events as simultaneous. This effect is 
known as Relativistic Phase or the Relativity of Simultaneity. Relativistic phase is 
often overlooked by students of Special Relativity, but if it is understood then phenomena 
such as the twin paradox are easier to understand. 

The way that clocks go out of phase along the line of travel can be calculated from the 
concepts of the invariance of the space-time interval and length contraction. 
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How clocks become out of phase 
along the line of travel 


Time 



X 


T= Xv/c 2 


Figure 13 


In the diagram above John is conventionally stationary. Distances between two points accord- 
ing to Bill are simple lengths in space (x) all at t=0 whereas John sees Bill's measurement 
of distance as a combination of a distance (X) and a time interval (T): 

x 2 = X 2 - (cT) 2 

But Bill's distance, x, is the length that he would obtain for things that John believes to be 
X metres in length. For Bill it is John who has rods that contract in the direction of motion 
so Bill's determination "x" of John's distance "X" is given from: 


x = X\J 1 — u 2 /c 2 

Thus x 2 = X 2 - (v 2 /c 2 )X 2 
So: (cT) 2 = {v 2 /c 2 )X 2 
And cT=(v/c)X 
So: T = (v/c 2 )X 
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Clocks that are synchronised for one observer go out of phase along the line of travel for 
another observer moving at v metres per second by \{y/c 2 ) seconds for every metre. This is 
one of the most important results of Special Relativity and should be thoroughly understood 
by students. 

The net effect of the four-dimensional universe is that observers who are in motion relative 
to you seem to have time coordinates that lean over in the direction of motion and consider 
things to be simultaneous that are not simultaneous for you. Spatial lengths in the direction 
of travel are shortened, because they tip upwards and downwards, relative to the time axis 
in the direction of travel, akin to a rotation out of three-dimensional space. 


Figure 5: How Bill's coordinates appear to John at 
the instant Bill passes him. 

John's time 
coordinate 



axis) 


axis) 


Figure 14 
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Great care is needed when interpreting space-time diagrams. Diagrams present data in two 
dimensions, and cannot show faithfully how, for instance, a zero length space-time interval 
appears. 

Figure 6: Space-time diagrams are often misleading 


t 



The path of a light ray is a space -time interval of zero but 
appears as a long line in the diagram. 


Figure 15 


It is sometimes mistakenly held that the time dilation and length contraction results only 
apply for observers at x=0 and t=0. This is untrue. An inertial frame of reference is defined 
so that length and time comparisons can be made anywhere within a given reference frame. 
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J ohn 1 s time Bill 1 s time 
axis axis 



Bill s x axis 


John's x axis 


Figure 16 Time dilation applies to time measurements taken between corresponding 
planes of simultaneity 


Time differences in one inertial reference frame can be compared with time differences 
anywhere in another inertial reference frame provided it is remembered that these differences 
apply to corresponding pairs of lines or pairs of planes of simultaneous events. 
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Figure 17 Spacetime diagram showing an event, a world line, and a line of simultaneity. 


In order to gain an understanding of both Galilean and Special Relativity it is important to 
begin thinking of space and time as being different dimensions of a four-dimensional vector 
space called spacetime. Actually, since we can't visualize four dimensions very well, it is 
easiest to start with only one space dimension and the time dimension. The figure shows a 
graph with time plotted on the vertical axis and the one space dimension plotted on the 
horizontal axis. An event is something that occurs at a particular time and a particular 
point in space. ("Julius X. wrecks his car in Lemitar, NM on 21 June at 6:17 PM. 1 ') A 
world line is a plot of the position of some object as a function of time (more properly, the 
time of the object as a function of position) on a spacetime diagram. Thus, a world line is 
really a line in spacetime, while an event is a point in spacetime. A horizontal line parallel 
to the position axis (x-axis) is a line of simultaneity, in Galilean Relativity all events on 
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this line occur simultaneously for all observers. It will be seen that the line of simultaneity 
differs between Galilean and Special Relativity; in Special Relativity the line of simultaneity 
depends on the state of motion of the observer. 

In a spacetime diagram the slope of a world line has a special meaning. Notice that a vertical 
world line means that the object it represents does not move — the velocity is zero. If the 
object moves to the right, then the world line tilts to the right, and the faster it moves, the 
more the world line tilts. Quantitatively, we say that 

velocity = slope of l world Kwe .(5.1) 

Notice that this works for negative slopes and velocities as well as positive ones. If the 
object changes its velocity with time, then the world line is curved, and the instantaneous 
velocity at any time is the inverse of the slope of the tangent to the world line at that time. 

The hardest thing to realize about spacetime diagrams is that they represent the past, 
present, and future all in one diagram. Thus, spacetime diagrams don't change with time — 
the evolution of physical systems is represented by looking at successive horizontal slices in 
the diagram at successive times. Spacetime diagrams represent the evolution of events, but 
they don't evolve themselves. 

3.2 The lightcone 

Things that move at the speed of light in our four dimensional universe have surprising 
properties. If something travels at the speed of light along the x-axis and covers x meters 
from the origin in t seconds the space-time interval of its path is zero. 

s 2 = x 2 — ( ct ) 2 

but x = ct so: 

s 2 = (ct) 2 — (ct) 2 = 0 

Extending this result to the general case, if something travels at the speed of light in any 
direction into or out from the origin it has a space-time interval of 0: 

0 = x 2 + y 2 + z 2 — (ct) 2 

This equation is known as the Minkowski Light Cone Equation. If light were travelling 
towards the origin then the Light Cone Equation would describe the position and time of 
emission of all those photons that could be at the origin at a particular instant. If light 
were travelling away from the origin the equation would describe the position of the photons 
emitted at a particular instant at any future time 't'. 
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A sphere of events 


A Slice of the 
sphere 




The observer is here ■ 

Currently observable 
events are on this 
surface 

Events that were once 
observable are mside this 
volume 




C >s Affective 



Z 

Space -Like 




1 ' 


2 2 2 

A Slice of a Sphere of Events Represented by a Light Cone (Solution of 0 = x + z - (ct) ) 


Figure 18 


At the superficial level the light cone is easy to interpret. Its backward surface represents 
the path of light rays that strike a point observer at an instant and its forward surface 
represents the possible paths of rays emitted from the point observer. Things that travel 
along the surface of the light cone are said to be light- like and the path taken by such 
things is known as a null geodesic. 

Events that lie outside the cones are said to be space-like or, better still space separated 
because their space time interval from the observer has the same sign as space (positive 
according to the convention used here). Events that lie within the cones are said to be 
time-like or time separated because their space-time interval has the same sign as time. 

However, there is more to the light cone than the propagation of light. If the added 
assumption is made that the speed of light is the maximum possible velocity then events 
that are space separated cannot affect the observer directly. Events within the backward 
cone can have affected the observer so the backward cone is known as the "affective past" 
and the observer can affect events in the forward cone hence the forward cone is known as 
the "affective future". 

The assumption that the speed of light is the maximum velocity for all communications is 
neither inherent in nor required by four dimensional geometry although the speed of light is 
indeed the maximum velocity for objects if the principle of causality is to be preserved by 
physical theories (ie: that causes precede effects). 
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3.3 The Lorentz transformation equations 

The discussion so far has involved the comparison of interval measurements (time intervals 
and space intervals) between two observers. The observers might also want to compare 
more general sorts of measurement such as the time and position of a single event that is 
recorded by both of them. The equations that describe how each observer describes the 
other's recordings in this circumstance are known as the Lorentz Transformation Equations. 
(Note that the symbols below signify coordinates.) 

John's time 
coordinate 

I Bill's time coordinate 


Bill's space 
coordinate (x 1 axis) 


John's space 
coordinate (x axis) 

The Lorentz Transformation: Ho w observers with two 
different coordinate systems view the same event. In this 
case John and Bill are in space vehicles travelling past 
each other at 'O' where they synchronise clocks. 

The objective is to calculate what the other 
observer reports as the time and position of the event 

Figure 19 

The table below shows the Lorentz Transformation Equations. 
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3.4 More about the relativity of simultaneity 

Most physical theories assume that it is possible to synchronise clocks. If you set up an 
array of synchronised clocks over a volume of space and take a snapshot of all of them 
simultaneously, you will find that the one closest to you will appear to show a later time 
than the others, due to the time light needs to travel from each of the distant clocks towards 
you. However, if the correct clock positions are known, by taking the transmission time of 
light into account, one can easily compensate for the differences and synchronise the clocks 
properly. The possibility of truly synchronising clocks exists because the speed of light is 
constant and this constant velocity can be used in the synchronisation process (the use of 
the predictable delays when light is used for synchronising clocks is known as "Einstein 
synchronisation " ) . 

The Lorentz transformation for time compares the readings of synchronised clocks at any 
instant. It compares the actual readings on clocks allowing for any time delay due to 
transmitting information between observers and answers the question "what does the other 
observer's clock actually read now, at this moment". The answer to this question is shocking. 
The Lorentz transformation for time shows that the clocks in any frame of reference moving 
relative to you cease to be synchronised! 
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Relativity shows that the inertial frames of 
reference of relatively moving objects do not 
overlie each other. 

Figure 20 


The desynchronisation between relatively moving observers is illustrated below with a simpler 
diagram: 
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The Relativity of Simultaneity and Phase 


t 




Phase describes how events that one 
observer measures to be simultaneous are 
not simultaneous for another observer. 


Figure 21 


The effect of the relativity of simultaneity is for each observer to consider that a different 
set of events is simultaneous. Phase means that observers who are moving relative to each 
other have different sets of things that are simultaneous, or in their "present moment". It is 
this discovery that time is no longer absolute that profoundly unsettles many students of 
relativity. 
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Figure 5: How Bill's coordinates appear to John at 
the instant Bill passes him. 

John's time 
coordinate 



Figure 22 


The amount by which the clocks differ between two observers depends upon the distance of 
the clock from the observer [t = xv/c 2 ). Notice that if both observers are part of inertial 
frames of reference with clocks that are synchronised at every point in space then the phase 
difference can be obtained by simply reading the difference between the clocks at the distant 
point and clocks at the origin. This difference will have the same value for both observers. 
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3.4.1 The Andromeda Paradox 

Relativistic phase differences have the startling consequence that at distances as large as our 
separation from nearby galaxies an observer who is driving on the earth can have a radically 
different set of events in her "present moment" from another person who is standing on the 
earth. The classic example of this effect of phase is the "Andromeda Paradox", also known 
as the "Rietdijk-Putnam-Penrose" argument. Penrose described the argument: 

"Two people pass each other on the street; and according to one of the two people, an 
Andromedean space fleet has already set off on its journey, while to the other, the decision 
as to whether or not the journey will actually take place has not yet been made. How can 
there still be some uncertainty as to the outcome of that decision? If to either person the 
decision has already been made, then surely there cannot be any uncertainty. The launching 
of the space fleet is an inevitability." (Penrose 1989). 

The argument is illustrated below: 

The Andromeda Paradox 



Figure 23 


Notice that neither observer can actually "see" what is happening on Andromeda now. 
The argument is not about what can be "seen", it is purely about what different observers 
consider to be contained in their instantaneous present moment. The two observers observe 
the same, two million year old events in their telescopes but the moving observer must 
assume that events at the present moment on Andromeda are a day or two in advance of 
those in the present moment of the stationary observer. (Incidentally, the two observers see 
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the same events in their telescopes because length contraction of the distance from Earth to 
Andromeda compensates exactly for the time difference on Andromeda.) 

This "paradox" has generated considerable philosophical debate on the nature of time 
and free-will. The advanced text of this book provides a discussion of some of the issues 
surrounding this geometrical interpretation of special relativity. 

A result of the relativity of simultaneity is that if the car driver launches a space rocket 
towards the Andromeda galaxy it might have a several days head start compared with a 
space rocket launched from the ground. This is because the "present moment" for the moving 
car driver is progressively advanced with distance compared with the present moment on 
the ground. The present moment for the car driver is shown in the illustration below: 


Time 



Distance 


For the car driver the stationary man and the invasion fleet are all events in the 
present moment. 



Figure 24 


The net effect of the Andromeda paradox is that when someone is moving towards a distant 
point there are later events at that point than for someone who is not moving towards the 
distant point. There is a time gap between the events in the present moment of the two 
people. 


3.5 The nature of length contraction 


According to special relativity items such as measuring rods consist of events distributed in 
space and time and a three dimensional rod is the events that compose the rod at a single 
instant. However, from the relativity of simultaneity it is evident that two observers in 
relative motion will have different sets of events that are present at a given instant. This 
means that two observers moving relative to each other will usually be observing measuring 
rods that are composed of different sets of events. If the word "rod" means the three 
dimensional form of the object called a rod then these two observers in relative motion 
observe different rods. 

The way that measuring rods differ between observers can be seen by using a Minkowski 
diagram. The area of a Minkowski diagram that corresponds to all of the events that 
compose an object over a period of time is known as the worldtube of the object. It can be 
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seen in the image below that length contraction is the result of individual observers having 
different sections of an object's worldtube in their present instant. 

The Nature of Length Contraction 


t t' 



Figure 25 


(It should be recalled that the longest lengths on space-time diagrams are often the shortest 
in reality). 

It is sometimes said that length contraction occurs because objects rotate into the time axis. 
This is actually a half truth, there is no actual rotation of a three dimensional rod, instead 
the observed three dimensional slice of a four dimensional rod is changed which makes it 
appear as if the rod has rotated into the time axis. In special relativity it is not the rod 
that rotates into time, it is the observer's slice of the worldtube of the rod that rotates. 
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There can be no doubt that the three dimensional slice of the worldtube of a rod does indeed 
have different lengths for relatively moving observers so that the relativistic contraction of 
the rod is a real, physical phenomenon. 

The issue of whether or not the events that compose the worldtube of the rod are always 
existent is a matter for philosophical speculation. 

Further reading: Vesselin Petkov. (2005) Is There an Alternative to the Block Universe 

View? 2 


3.6 More about time dilation 

The term "time dilation" is applied to the way that observers who are moving relative to 
you record fewer clock ticks between events than you. In special relativity this is not due 
to properties of the clocks, such as their mechanisms getting heavier. Indeed, it should 
not even be said that the clocks tick faster or slower because what is truly occurring is 
that the clocks record shorter or longer elapsed times and this recording of elapsed time is 
independent of the mechanism of the clocks. The differences between clock readings are due 
to the clocks traversing shorter or longer distances between events along an observer's path 
through spacetime. This can be seen most clearly by re-examining the Andromeda Paradox. 

Suppose Bill passes Jim at high velocity on the way to Mars. Jim has previously synchronised 
the clocks on Mars with his Earth clocks but for Bill the Martian clocks read times well in 
advance of Jim's. This means that Bill has a head start because his present instant contains 
what Jim considers to be the Martian future. Jim observes that Bill travels through 
both space and time and expresses this observation by saying that Bill's clocks recorded 
fewer ticks than his own. Bill achieves this strange time travel by having what Jim considers 
to be the future of distant objects in his present moment. Bill is literally travelling into 
future parts of Jim's frame of reference. 

In special relativity time dilation and length contraction are not material effects, they are 
physical effects due to travel within a four dimensional spacetime. The mechanisms of the 
clocks and the structures of measuring rods are irrelevant. 

It is important for advanced students to be aware that special relativity and General Relativity 
differ about the nature of spacetime. General Relativity, in the form championed by Einstein, 
avoids the idea of extended space and time and is what is known as a "relationalist" theory 
of physics. Special relativity, on the other hand, is a theory where extended spacetime 
is pre-eminent. The brilliant flowering of physical theory in the early twentieth century 
has tended to obscure this difference because, within a decade, special relativity had been 
subsumed within General Relativity. The interpretation of special relativity that is presented 
here should be learnt before advancing to more advanced interpretations. 


2 http : //philsci- archive .pitt . edu/archive/00002408/ 
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3.7 The twin paradox 

In the twin paradox there are twins, Bill and Jim. Jim is on Earth. Bill flies past Jim in a 
spaceship, goes to a distant point such as Mars, turns round and flies back again. It is found 
that Bill records fewer clock ticks over the whole journey than Jim records on earth. Why? 

The twin paradox seems to cause students more problems than almost any other area of 
special relativity. Students sometimes reason that "all motion is relative" and time dilation 
applies so wonder why, if Jim records 25 seconds for the journey and sees Bill's clocks read 
15 seconds, Bill doesn't reciprocally see Jim's clocks read only 9 seconds? This mistake 
arises for two reasons. Firstly, relativity does not hold that "all motion is relative;" this 
is not a postulate of the theory. Secondly, Bill moves through space, so the effects of the 
relativity of simultaneity must be considered as well as time dilation. The analysis given 
below follows Bohm's approach (see "further reading" below). It demonstrates that the twin 
"paradox", or more correctly, the way that the twin's clocks read different elapsed times, is 
due in large part to the relativity of simultaneity. 

The effects of the relativity of simultaneity such as are seen in the "Andromeda paradox" 
are, in part, the origin of the "twin paradox". If you have not understood the Andromeda 
Paradox you will not understand the twin paradox because it will not be obvious that the 
twin who turns round has a head start. The relativity of simultaneity means that if Bill 
flies past Jim in the direction of Mars then Bill finds that any of Jim's clocks on Mars will 
already be reading a time that is in Jim's future. Bill gets a head start on the journey 
because for him Mars is already in Jim's future. Examine the diagram below, the x' axis 
connects all the events that Bill considers to be in his present moment, notice that these 
events get ever further into Jim's future with distance. Bill is flying to a Mars that is already 
in Jim's future. If you understand this then you will understand the twin paradox. 
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The Relativity of Simultaneity and Phase 




Phase describes how events that one 
observer measures to be simultaneous are 
not simultaneous for another observer. 


Figure 26 


The analysis of the twin paradox begins with Jim and Bill synchronising clocks in their 
frames of reference. Jim synchronises his clocks on Earth with those on Mars. As Bill flies 
past Jim he synchronises his clock with Jim's clock on Earth. When he does this he realises 
that the relativity of simultaneity applies and so, for Bill, Jim's clocks on Mars are not 
synchronised with either his own or Jim's clocks on Earth. There is a time difference, or 
"gap", between Bill's clocks and those on Mars even when he passes Jim. This difference is 
equal to the relativistic phase at the distant point. This set of events is almost identical to 
the set of events that were discussed above in the Andromeda Paradox. This is the most 
crucial part of understanding the twin paradox: to Bill the clocks that Jim has placed on 
Mars are already in Jim's future even as Bill passes Jim on Earth. 

Bill flies to Mars and discovers that the clocks there are reading a later time than his own 
clock. He turns round to fly back to Earth and realises that the relativity of simultaneity 
means that, for Bill, the clocks on Earth will have jumped forward and are ahead of those 
on Mars, yet another "time gap" appears. When Bill gets back to Earth the time gaps and 
time dilations mean that people on Earth have recorded more clock ticks that he did. 

In essence the twin paradox is equivalent to two Andromeda paradoxes, one for the outbound 
journey and one for the inbound journey with the added spice of actually visiting the distant 
points. 

For ease of calculation suppose that Bill is moving at a truly astonishing velocity of 0.8c in 
the direction of a distant point that is 10 light seconds away (about 3 million kilometres). 
The illustration below shows Jim and Bill's observations: 


48 


The twin paradox 


The Time Gap Explanation of the Twin 'Paradox 


The outward journey 



The return journey 



10 light secs distance for Jim 
6 light secs distance for Bill 


Bill travels 10 light secs at 0.8c so takes 
12.5 secs to get to the destination according 
to Jim. 

Bill observes the distance to be 6 light 
secs ie: | T 

10 Jl - 0.8^ 

Bill takes 7.5 secs to make the journey 
according to his own clocks, ie: x/v = 6/0.8 

Bill observes ONLY 4.5 secs elapsed time on 
the clock at the destination. 

The other 8 secs on the clock are due to 
the initial time gap. Even when Bill is next to 
Jim he sees that the destination is already 8 
secs into Jim's future. 


The overall journey 



6 liqht secs distance for Bill 


The time dilation in the twin paradox is symmetrical. Jim observes Bill's clocks go slow, reading only 15 secs change on 
Bill's clocks when his own read that 25 secs have passed. Bill observes only 9 secs elapse on Jim's clocks when his 
own show 15 secs have passed. However Bill also observes Jim's clocks jump by 16 secs as a result of phase making a 
total 25 secs for Jim's clocks. 


Figure 27 


From Bill's viewpoint there is both a time dilation and a phase effect. It is the added factor 
of "phase" that explains why, although the time dilation occurs for both observers, Bill 
observes the same readings on Jim's clocks over the whole journey as does Jim. 

To summarise the mathematics of the twin paradox using the example: 

Jim observes the distance as 10 light seconds and the distant point is in his frame of reference. 
According to Jim it takes Bill the following time to make the journey: 

Time taken = distance / velocity therefore according to Jim: 

t = 10/0.8 = 12.5 seconds 

Again according to Jim, time dilation should affect the observed time on Bill's clocks: 

T = tx a/1 — v 2 /c 2 so: 

T = 12.5 x a/ 1-0.8 2 = 7.5 seconds 

So for Jim the round trip takes 25 secs and Bill's clock reads 15 secs. 
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Bill measures the distance as: 

X = x x \J\ — v 2 /c 2 = 10 x Vl — 0.8 2 = 6 light seconds. 

For Bill it takes X/v = 6/0.8 = 7.5 seconds. 

Bill observes Jim's clocks to appear to run slow as a result of time dilation: 
so: 

^'=7.5x\/ 1-0.8 2 =4.5 seconc l s 
But there is also a time gap of vx/c 2 = 8 seconds. 

So for Bill, Jim's clocks register 12.5 secs have passed from the start to the distant point. 
This is composed of 4.5 secs elapsing on Jim's clocks at the turn round point plus an 8 sec 
time gap from the start of the journey. Bill sees 25 secs total time recorded on Jim's clocks 
over the whole journey, this is the same time as Jim observes on his own clocks. 

It is sometimes dubiously asserted that the twin paradox is about the clocks on the twin 
that leaves earth being slower than those on the twin that stays at home, it is then argued 
that biological processes contain clocks therefore the twin that travelled away ages less. This 
is not really true because the relativistic phase plays a major role in the twin paradox and 
leads to Bill travelling to a remote place that, for Bill, is at a later time than Jim when Bill 
and Jim pass each other. A more accurate explanation is that when we travel we travel in 
time as well as space. 

Students have difficulty with the twin paradox because they believe that the observations of 
the twins are symmetrical. This is not the case. As can be seen from the illustration below 
either twin could determine whether they had made the turn or the other twin had made 
the turn. 
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The twin "paradox" is NOT symmetrical 




Two space ships have pulsatile ion drives that leave clouds of ions behind that 
glow in space. When they observe their trails it is clear who has followed which 
path. (They could also use accelerometers to make the same determination). 

Figure 28 


The twin paradox can also be analysed without including any turnaround by Bill. Suppose 
that when Bill passes Mars he meets another traveller coming towards Earth. If the two 
travellers synchronise clocks as they pass each other they will obtain the same elapsed times 
for the whole journey to Mars and back as Bill would have recorded himself. This shows 
that the "paradox" is independent of any acceleration effects at the turnaround point. 


3.7.1 Jim and Bill's view of the journey 

Special relativity does not postulate that all motion is 'relative'; the postulates are that the 
laws of physics are the same in all inertial frames and there is a constant velocity called the 
"speed of light". Contrary to popular myth the twins do not observe events that are a mirror 
image of each other. Bill observes himself leave Jim then return, Jim sees Bill leave him 
then return. Bill does not observe Jim turn round, he observes himself making the turn. 

The following illustrations cover various views of the journey. The most important moment 
in the journey is the point where Bill turns round. Notice how Bill's surface of simultaneity, 
that includes the events that he considers to be in the present moment, swings across Jim's 
worldline during the turn. 
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When Bill turns round what he considers 
to be the "present moment" changes 
dramatically. His plane of simultaneity 
swings abruptly from one set of points in 
Jim's frame of reference to another. 


As Bill travels away from Jim he considers events that are already in Jim's past to be in his 
own present. 


Jim and Bill's view of the outbound journey 

t t 1 



After the turn Bill considers events that are in Jim's future to be in his present (although 
the finite speed of light prevents Bill from observing Jim's future). 
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Jim and Bill's view of the inbound journey 


t t 1 



The swing in Bill's surface of simultaneity at the turn-round point leads to a 'time gap'. In 
our example Bill might surmise that Jim's clocks jump by 16 seconds on the turn. 

Jim and Bill's view of the turn - the time gap 


Jinn's 

future, now 
Bill's past 

Jinn's 
present “ 


Bill's path 



Jim's view 


Jim's 
apparent path 


Bill's 

present 



4.5secs 


4.5secs 


This period of Jim's time 
occurs instantaneously 
for Bill. 


How we would imagine Bill's 
view if the twin paradox 
were symmetrical. 


Figure 32 


Notice that the term "Jim's apparent path" is used in the illustration - as was seen earlier, 
Bill knows that he himself has left Jim and returned so he knows that Jim's apparent path 
is an artefact of his own motion. If we imagine that the twin paradox is symmetrical then 
the illustration above shows how we might imagine Bill would view the journey. But what 
happens, in our example, to the 16 seconds in the time gap, does it just disappear? The 
twin paradox is not symmetrical and Jim does not make a sudden turn after 4.5 seconds. 
Bill's actual observation and the fate of the information in the time gap can be probed by 
supposing that Jim emits a pulse of light several times a second. The result is shown in the 
illustration below. 
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Jim and Bill's view 


t' 

If Jim emits flashes of light 
at fixed intervals it can be 
seen that; if Bill is travelling 
at 0 * 8 C; most of the flashes 
are observed by Bill on the 
inbound journey. 

No flashes are lost. 

i 

Bill must assume that it is 
he who turned around; this 
being the only explanation 
that is consistent with the 
pattern of flashes. 


I 

I 

I 

i 




Jim's idea of where Bill's observation 

the flashes go. of the flashes 


The turn results in a sudden 
realignment of Bill and Jim's 
time axes. 



Figure 33 


Jim has clearly but one inertial frame but does Bill represent a single inertial frame? Suppose 
Bill was on a planet as he passed Jim and flew back to Jim in a rocket from the turn-round 
point: how many inertial frames would be involved? Is Bill's view a view from a single 
inertial frame? 

Exercise: it is interesting to calculate the observations made by an observer who continues 
in the direction of the outward leg of Bill's journey - note that a velocity transformation 
will be needed to estimate Bill's inbound velocity as measured by this third observer. 

Further reading: 

Bohm, D. The Special Theory of Relativity (W. A. Benjamin, 1965). 

D’Inverno, R. Introducing Einstein’s Relativity (Oxford University Press, 1992). 

Eagle, A. A note on Dolby and Gull on radar time and the twin "paradox". American 
Journal of Physics. 2005, VOL 73; NUMB 10, pages 976-978. http://arxiv.org/PS_ 
cache/physics/pdf /041 1/041 1008v2 .pdf 


3.8 The Pole-barn paradox 


The length contraction in relativity is symmetrical. When two observers in relative motion 
pass each other they both measure a contraction of length. 
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Length contraction as a result of the Length contraction is symmetrical 

relativity of simultaneity 


Ti ™ e Time' 

Distance ' 

Distance 

kv/c metres 

An observer picks up one of two identical rods and 
travels back past the other rod at high velocity (v). 

2 2 2 

Using the Minkowskian metric: x = x (xv/c) 

I 2 2 ~ 

Therefore: x' = x /I - v /c 



Time Time ' 



Distance ' 


Distance 


The observer moving in the primed frame observes the 
REST length of his moving rod so gets the same contraction 
as the observer in the unprimed assigns when observing 
the rod in the primed frame. 


Figure 34 


(Note that Minkowski's metric involves the subtraction of displacements in time, so what 
appear to be the longest lengths on a 2D sheet of paper are often the shortest lengths in a 
(3+l)D reality). 

The symmetry of length contraction leads to two questions. Firstly, how can a succession 
of events be observed as simultaneous events by another observer? This question led to 
the concept of de Broglie waves and quantum theory. Secondly, if a rod is simultaneously 
between two points in one frame how can it be observed as being successively between those 
points in another frame? For instance, if a pole enters a building at high speed how can one 
observer find it is fully within the building and another find that the two ends of the rod 
are opposed to the two ends of the building at successive times? What happens if the rod 
hits the end of the building? The second question is known as the "pole- barn paradox" or 
"ladder paradox". 
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Pole-barn paradox 


v t 



In the frame of the barn the pole is simultaneously at the rear of 
the barn and the door. In the frame of the pole these events occur 
sequentially. 


Figure 35 


The pole-barn paradox states the following: suppose a superhero running at 0.75c and 
carrying a horizontal pole 15 m long towards a barn 10m long, with front and rear doors. 
When the runner and the pole are inside the barn, a ground observer closes and then opens 
both doors (by remote control) so that the runner and pole are momentarily captured inside 
the barn and then proceed to exit the barn from the back door. 

One may be surprised to see a 15-rn pole fit inside a 10-rn barn. But the pole is in motion 
with respect to the ground observer, who measures the pole to be contracted to a length of 
9.9 m (check using equations). 

The “paradox” arises when we consider the runner’s point of view. The runner sees the 
barn contracted to 6.6 m. Because the pole is in the rest frame of the runner, the runner 
measures it to have its proper length of 15 nr. Now, how can our superhero make it safely 
through the barn? 

The resolution of the “paradox” lies in the relativity of simultaneity. The closing of the two 
doors is measured to be simultaneous by the ground observer. However, since the doors are 
at different positions, the runner says that they do not close simultaneously. The rear door 
closes and then opens first, allowing the leading edge of the pole to exit. The front door of 
the barn does not close until the trailing edge of the pole passes by. 
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What happens if the rear door is kept closed and made out of some impenetrable material? 
Can we or can we not trap the rod inside the barn by closing the front door while the whole 
rod is inside according to a ground observer? When the front end of the rod hits the rear 
door, information about this impact will travel backwards along the rod in the form of a 
shock wave. The information cannot travel faster than c, so the rear end of the rod will 
continue to travel forward at its original speed until the wave reaches it. Even if the shock 
wave is traveling at the speed of light, it will not reach the rear end of the rod until after 
the rear end has passed through the front door even in the runner's frame. Therefore the 
whole rod (albeit quite scrunched up) will be inside the barn when the front door closes. If 
it is infinitely elastic, it will end up compressed and "spring loaded" against the inside of the 
closed barn. 

3.8.1 Evidence for length contraction, the field of an infinite straight 
current 

Length contraction can be directly observed in the field of an infinitely straight current. 
This is shown in the illustration below. 





Negative charges that are moving away from 
Bill at velocity v but stationary to Jim 






oooooooooooooooooooooooooooooooooooooo 

ooooooooooooooooooooooooooooooooooooo 


Stationary positive charges 


Bill 


Figure 36 


Non-relativistic electromagnetism describes the electric field due to a charge using: 

E =v^- 

Zireor 

and describes the magnetic field due to an infinitely long straight current using the Biot 
Savart law: 

R __ M Of 
2nr 

Or using the charge density (from I = Xv where A ): 

TD MO XV 

2nr 

Using relativity it is possible to show that the formula for the magnetic field given above 
can be derived using the relativistic effect of length contraction on the electric field and so 
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what we call the "magnetic" field can be understood as relativistic observations of a single 
phenomenon. The relativistic calculation is given below. 

If Jim is moving relative to the wire at the same velocity as the negative charges he sees the 
wire contracted relative to Bill: 


L |_ = lyjl — v 2 /c 2 

Bill should see the space between the charges that are moving along the wire to be contracted 
by the same amount but the requirement for electrical neutrality means that the moving 
charges will be spread out to match those in the frame of the fixed charges in the wire. 


This means that Jim sees the negative charges spread out so that: 
l_ = 1 

sj 1 — V 2 /c 2 

The net charge density observed by Jim is: 


A 



_ 2 _ 

4 


Substituting: 




Using the binomial expansion: 


A = f(l 


V 2 1 _ V 2 s i 

2c? 1 2c? 1 


Therefore, allowing for a net positive charge, the positive charges being fixed: 



The electric field at Jim's position is given by: 

^7 Xv 2 

2neo rc 2 

The force due to the electrical field at Jim's position is given by F = Eq which is: 


F = 


q\v 2 
2neorc 2 


Now, from classical electromagnetism: 


c 


2 


1 

£0M0 


So substituting this into F 


(1 )F 


_ qi-LQ\v 2 
2n r 


qXv 2 . 
2rceorc 2 ' 


This is the formula for the relativistic electric force that is observed by Bill as a magnetic 
force. How does this compare with the non-relativistic calculation of the magnetic force? 
The force on a charge at Jim's position due to the magnetic held is, from the classical 
formula: 


F = Bqv 

Which from the Biot-Savart law is: 

(2)F=«4 
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which shows that the same formula applies for the relativistic excess electrical force experi- 
enced by Jim as the formula for the classical magnetic force. 

It can be seen that once the idea of space-time is understood the unification of the two fields 
is straightforward. Jim is moving relative to the wire at the same speed as the negatively 
charged current carriers so Jim only experiences an electric field. Bill is stationary relative 
to the wire and observes that the charges in the wire are balanced whereas Jim observes an 
imbalance of charge. Bill assigns the attraction between Jim and the current carriers to a 
"magnetic field". 

It is important to notice that, in common with the explanation of length contraction given 
above, the events that constitute the stream of negative charges for Jim are not the same 
events as constitute the stream of negative charges for Bill. Bill and Jim's negative charges 
occupy different moments in time. 

Incidently, the drift velocity of electrons in a wire is about a millimetre per second but a 
huge charge is available in a wire (See link below). 

Further reading: 

Purcell, E. M. Electricity and Magnetism. Berkeley Physics Course. Vol. 2. 2nd ed. New 
York, NY: McGraw-Hill. 1984. ISBN: 0070049084. 

Useful links: 

http : //hyperphysics . phy-astr . gsu . edu/hbase/electric/ohmmic . html 
http : //hyperphysics . phy-astr . gsu . edu/hbase/relat iv/releng . html 

3.8.2 De Broglie waves 

De Broglie noticed that the differing three dimensional sections of the universe would cause 
oscillations in the rest frame of an observer to appear as wave trains in the rest frame of 
observers who are moving. 
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De Broglie Waves 


A line of events is on for a moment 


Then goes off 


Then on again 

= 

The flashing line can be represented as a 
worldtube. 



x 


t f 



When a rapidly flashing 
source moves it looks like 
a succession of waves in 
space. 


Figure 37 


He combined this insight with Einstein's ideas on the quantisation of energy to create the 
foundations of quantum theory. De Broglie's insight is also a round-about proof of the 
description of length contraction given above - observers in relative motion have differing 
three dimensional slices of a four dimensional universe. The existence of matter waves is 
direct experimental evidence of the relativity of simultaneity. 
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De Broglie waves 

De Broglie waves result from the differing planes of simultaneity for an 
observer and a moving particle field. 


Observer's Time 




• x axis 


A De Broglie wave is an oscillation occuring 
simultaneously in the rest frame of the 
particle but not simultaneously in the frame 
of the observer. 


Time 



The observer sees the 
oscillation as waves 
distributed in space 


Note: the plane of simultaneity of the observer is shown schematically above. 



\ 

\ 

\ 



The Heisenberg uncertainty principle can be 
derived from the fourier transfrom of the pulse ie: 

B 

\p{x) = $g{k)cos{kx)dk 
0 

Mai / (2M) 


The effect is to produce a spatially 
distributed pulse 


MMal/2. \>\\ik =2x! A =2x p! h =p ) h, 
ApAxxhl 2 


Figure 38 


Further reading: de Broglie, L. (1925) On the theory of quanta. A 

translation of : RECHERCHES SUR LA THEORIE DES QUANTA (Ann. 

de Phys., lOe s'erie, t. Ill (Janvier-F 'evrier 1925). by: A. F. Krack- 

lauer. http : //replay .web . archive . org/20090509012910/http : //www. ensmp . f r/af lb/ 
LDB-oeuvres/De_Broglie_Kracklauer .pdf 


3.8.3 Bell's spaceship paradox 

Bell devised a thought experiment called the "Spaceship Paradox" to enquire whether length 
contraction involved a force and whether this contraction was a contraction of space. In 
the Spaceship Paradox two spaceships are connected by a thin, stiff string and are both 
equally and linearly accelerated to a velocity v relative to the ground, at which, in the 
special relativity version of the paradox, the acceleration ceases. The acceleration on both 
spaceships is arranged to be equal according to ground observers so, according to observers 
on the ground, the spaceships will stay the same distance apart. It is asked whether the 
string would break. 
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It is useful when considering this problem to investigate what happens to a single spaceship. 
If a spaceship that has rear thrusters is accelerated linearly, according to ground observers, 
to a velocity v then the ground observers will observe it to have contracted in the direction 
of motion. The acceleration experienced by the front of the spaceship will have been slightly 
less than the acceleration experienced by the rear of the spaceship during contraction and 
then would suddenly reach a high value, equalising the front and rear velocities, once the rear 
acceleration and increasing contraction had ceased. From the ground it would be observed 
that overall the acceleration at the rear could be linear but the acceleration at the front 
would be non-linear. 

In Bell's thought experiment both spaceships are artificially constrained to have constant 
acceleration, according to the ground observers, until the acceleration ceases. Sudden 
adjustments are not allowed. Furthermore no difference between the accelerations at the 
front and rear of the assembly are permitted so any tendency towards contraction would 
need to be borne as tension and extension in the string. 

The most interesting part of the paradox is what happens to the space between the ships. 
From the ground the spaceships will stay the same distance apart (the experiment is arranged 
to achieve this) whilst according to observers on the spaceships they will appear to become 
increasingly separated. This implies that acceleration is not invariant between reference 
frames (see Part II) and the force applied to the spaceships will indeed be affected by the 
difference in separation of the ships observed by each frame. 

The section on the nature of length contraction above shows that as the string changes 
velocity the observers on the ground observe a changing set of events that compose the 
string. These new events define a string that is shorter than the original. This means that 
the string will indeed attempt to contract as observed from the ground and will be drawn 
out under tension as observed from the spaceships. If the string were unable to bear the 
extension and tension in the moving frame or the tension in the rest frame it would break. 

Another interesting aspect of Bell's Spaceship Paradox is that in the inertial frames of the 
ships, owing to the relativity of simultaneity, the lead spaceship will always be moving 
slightly faster than the rear spaceship so the spaceship-string system does not form a true 
inertial frame of reference until the acceleration ceases in the frames of reference of both 
ships. The asynchrony of the cessation of acceleration shows that the lead ship reaches the 
final velocity before the rear ship in the frame of reference of either ship. However, this time 
difference is very slight (less than the time taken for an influence to travel down the string 
at the speed of light x/c> vx/c 2 ). 

It is necessary at this stage to give a warning about extrapolating special relativity into the 
domain of general relativity (GR). SR cannot be applied with confidence to accelerating 
systems which is why the comments above have been confined to qualitative observations. 

Further reading 

Bell, J. S. (1976). Speakable and unspeakable in quantum mechanics. Cambridge University 
Press 1987 ISBN 0-521-52338-9 

Hsu, J-P and Suzuki, N. (2005) Extended Lorentz Transformations for Accelerated Frames 
and the Solution of the “Two- Spaceship Paradox” AAPPS Bulletin October 2005 p.17 
http : //www . aapps . org/archive/bulletin/voll5/15-5/15_5_pl7p21°/ 0 7F . pdf 
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Matsuda, T and Kinoshita, A (2004. A Paradox of Two Space Ships in Special Relativity. 
AAPPS Bulletin February 2004 p3. http://www.aapps.org/archive/bulletin/voll4/ 
14_1/ 14_l_p03p07 . pdf 


3.9 The transverse doppler effect 


The existence of time dilation means that the frequency of light emitted from a source that 
is moving towards or away from an observer should be red shifted in directions that are 
perpendicular to the direction of motion. The transverse doppler effect is given by: 


/ 

v = v 



Where u is the observed frequency and v' is the frequency if the source were stationary 
relative to the observer (the proper frequency). 

This effect was first confirmed by Ives and Stillwell in 1938. The transverse doppler effect 
is a purely relativistic effect and has been used as an example of proof that time dilation 
occurs. 


3.10 Relativistic transformation of angles 


If a rod makes an angle with its direction of motion toward or away from an observer the 
component of its length in the direction of motion will be contracted. This means that 
observed angles are also transformed during changes of frames of reference. Assuming that 
motion occurs along the x-axis, suppose the rod has a proper length (rest length) of l! 
metres and makes an angle of #'degrees with the x'-axis in its rest frame. The tangent of 
the angle made with the axes is: 

Jj 

Tangent in rest frame of rod = tan O' = r J 

Tangent in observer's frame = tan# = ^ 

Therefore: 

tan# L y L' x 
tan#' L' y L x 

But L x = L' x i/l — w 2 /c 2 
And L y = L' y 
So 

tan #' 
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Showing that angles with the direction of motion are observed to increase with velocity. 

The angle made by a moving object with the x-axis also involves a transformation of velocities 
to calculate the correct angle of incidence. 


3.11 Addition of velocities 


How can two observers, moving at v m/sec relative to each other, compare their observations 
of the velocity of a third object? 


Addition of velocities 


Time Time 1 



The velocity for one observer is given by: 
u 1 = x 1 / t 1 

For the other observer it is given by: 
u — x / t 

Probem: find u in terms of u 1 and the relative 
velocity of the observers, v. 


Figure 39 
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Suppose one of the observers measures the velocity of the object as u where: 
/ 

/ x 

U - t' 

The coordinates x and t are given by the Lorentz transformations: 

/ x — vt 

X a/ (i— i> 2 /c 2 ) 

and 

/ t — (v/c 2 )x 

t V (l-v 2 /c 2 ) 

but 



so: 

/ t — (v/c 2 )x 

X y / (1 — V^/c^) 

l—v 2 /c 2 ) 

and hence: 

X — vt = u( t ~ vx / < ?) 

Notice the role of the phase term vx/c 2 . The equation can be rearranged as: 

, _ (»'+”) 

X ~ (i+u v / c2 H 

given that x = ut: 

_ (u +v \ 

U — (l+u'v/c 2 ) 

This is known as the relativistic velocity addition theorem, it applies to velocities 
parallel to the direction of mutual motion. 

The existence of time dilation means that even when objects are moving perpendicular 
to the direction of motion there is a discrepancy between the velocities reported for an 
object by observers who are moving relative to each other. If there is any component of 
velocity in the x direction ( u x , u x ) then the phase affects time measurement and hence the 
velocities perpendicular to the x-axis. The table below summarises the relativistic addition 
of velocities in the various directions in space. 


3.12 Introduction 

The way that the velocity of a particle can differ between observers who are moving relative 
to each other means that momentum needs to be redefined as a result of relativity theory. 

The illustration below shows a typical collision of two particles. In the right hand frame the 
collision is observed from the viewpoint of someone moving at the same velocity as one of 
the particles, in the left hand frame it is observed by someone moving at a velocity that is 
intermediate between those of the particles. 
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Relativistic particle collisions 



View of observer who is travelling at a 
velocity intermediate between that of the 
balls. (Red = +v Blue = -v Observer =0 m/s) 


V 



View of observer travelling at same velocity as the 
blue ball. 


Figure 40 


If momentum is redefined then all the variables such as force (rate of change of momentum), 
energy etc. will become redefined and relativity will lead to an entirely new physics. The new 
physics has an effect at the ordinary level of experience through the relation K = r ymc 2 — me 2 
whereby it is the tiny deviations in gamma from unity that are expressed as everyday kinetic 
energy so that the whole of physics is related to "relativistic" reasoning rather than Newton's 
empirical ideas. 


3.13 Momentum 

In physics momentum is conserved within a closed system, the law of conservation of 
momentum applies. Consider the special case of identical particles colliding symmetrically 
as illustrated below: 
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A symmetrical Newtonian collision 



Figure 41 

The momentum change by the red ball is: 

2mu y R 

The momentum change by the blue ball is: 

— 2mu y B 

The sum of the momentum changes is zero, so the Newtonian conservation of momentum 
law is demonstrated: 

2mu y R — 2mu y B = 0 

Notice that this result depends upon the y components of the velocities being equal, that is, 

UyR = UyR. 

The relativistic case is rather different. The collision is illustrated below, the left hand frame 
shows the collision as it appears for one observer and the right hand frame shows exactly 
the same collision as it appears for another observer moving at the same velocity as the 
blue ball: 
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u y r 



View of observer who is travelling at a View of observer travelling at same velocity as the 

velocity intermediate between that of the blue ball, 
balls. (Red = +v Blue = -v Observer =0 m/s) 

The uppercase letters (U) represent velocities after the collision. 


Figure 42 


The configuration shown above has been simplified because one frame contains a stationary 
blue ball (ie: u x b = 0) and the velocities are chosen so that the vertical velocity of the 

' =—u B 

red ball is exactly reversed after the collision ie:u yR v . Both frames show exactly the 
same event, it is only the observers who differ between frames. The relativistic velocity 
transformations between frames is: 

l_ u yRV 1 - v 2 /c 2 

1 ~' u xR v / c2 

U yR 


l_ u yB 

1 

l yB 


yj l — v^/c^ 
iB v /c 2 


■UyBy/l-v 2 /c 2 

given that u x b = 0 . 


Suppose that the y components are equal in one frame, in Newtonian physics they will also 
be equal in the other frame. However, in relativity, if the y components are equal in one 
frame they are not necessarily equal in the other frame (time dilation is not directional so 

'=u 

perpendicular velocities differ between the observers). For instance if u yR yB then: 

7/ D = UyR 

yB l-U xR v/c 2 


So if u yR yB then in this case u v r / u v b ■ 


If the mass were constant between collisions and between frames then although 
2mu yR = 2mu yB it is found that: 


2/nUyR / 2?7lUyB 


So momentum defined as mass times velocity is not conserved in a collision when the collision 
is described in frames moving relative to each other. Notice that the discrepancy is very 
small if u x r and v are small. 


To preserve the principle of momentum conservation in all inertial reference frames, the 
definition of momentum has to be changed. The new definition must reduce to the Newtonian 
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expression when objects move at speeds much smaller than the speed of light, so as to 
recover the Newtonian formulas. 


The velocities in the y direction are related by the following equation when the observer is 
travelling at the same velocity as the blue ball ie: when u x b = 0 : 

a d = UyR 
yB l-u xR v/c 2 

If we write tub for the mass of the blue ball) and rriji for the mass of the red ball as observed 
from the frame of the blue ball then, if the principle of relativity applies: 

2 m.RUy R = 2m B u y B 

So: 


Uy B 


But: 


UyB l-u xR v/c 2 

Therefore: 


rriR = 


niB 

l-u xR v/c 2 


This means that, if the principle of relativity is to apply then the mass must change by the 
amount shown in the equation above for the conservation of momentum law to be true. 

, ' 

=—U b ' — 

The reference frame was chosen so that u yR v and hence u~ R . This allows v to be 
determined in terms of u x r : 


/_ u xR~ v 
1 ~ u xR y / c ‘ 2 

U xR 


=v 


and hence: 


v =£^( i ~J i ~ u Ir/ c2 ) 


So substituting for v in rriR = 1 _ : 


mp = Sfl 

\/ 1 - u Ir / c ' 2 


The blue ball is at rest so its mass is sometimes known as its rest mass, and is given the 
symbol m. As the balls were identical at the start of the boost the mass of the red ball 
is the mass that a blue ball would have if it were in motion relative to an observer; this 
mass is sometimes known as the relativistic mass symbolised by M . These terms are 
now infrequently used in modern physics, as will be explained at the end of this section. 
The discussion given above was related to the relative motions of the blue and red balls, 
as a result u x r corresponds to the speed of the moving ball relative to an observer who 
is stationary with respect to the blue ball. These considerations mean that the relativistic 
mass is given by: 


M = 


m 

yjl — lp/c 2 


The relativistic momentum is given by the product of the relativistic mass and the velocity 
p = Mu. 
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The overall expression for momentum in terms of rest mass is: 


P = 


mu 

\J \—u 2 1 c? 


and the components of the momentum are: 


Px = 


mu x 

yj 1 — U 2 / C 2 


Py 


mu y 

\J l—u 2 /c 2 


Pz = 


mu~ 

y/l — U 2 !c 2 


So the components of the momentum depend upon the appropriate velocity component and 
the speed. 


Since the factor with the square root is cumbersome to write, the following abbreviation is 
often used, called the Lorentz gamma factor: 


^ \J l — u 2 / c 2 

The expression for the momentum then reads p = my u. 


It can be seen from the discussion above that we can write the momentum of an object 
moving with velocity u as the product of a function M(u ) of the speed u and the velocity u: 

M(u) u 

The function M(u ) must reduce to the object's mass to at small speeds, in particular when 
the object is at rest M( 0) = m. 

There is a debate about the usage of the term "mass" in relativity theory. If inertial mass 
is defined in terms of momentum then it does indeed vary as M = 7 m for a single particle 
that has rest mass, furthermore, as will be shown below the energy of a particle that has a 
rest mass is given by E = Me 2 . Prior to the debate about nomenclature the function M(u), 
or the relation M = 7 to, used to be called 'relativistic mass', and its value in the frame of 
the particle was referred to as the 'rest mass' or 'invariant mass'. The relativistic mass, 
M = 7 m, would increase with velocity. Both terms are now largely obsolete: the 'rest mass' 
is today simply called the mass, and the 'relativistic mass' is often no longer used since, as 
will be seen in the discussion of energy below, it is identical to the energy but for the units. 


3.14 Force 

Newton's second law states that the total force acting on a particle equals the rate of change 
of its momentum. The same form of Newton's second law holds in relativistic mechanics. 
The relativistic 3 force is given by: 

f = dp/ dt 

If the relativistic mass is used: 

dp d(mu) 

dt dt 

By Leibniz's law where d(xy) = xdy + ydx : 
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. dm 


This equation for force will be used below to derive relativistic expressions for the energy of 
a particle in terms of the old concept of "relativistic mass". 


The relativistic force can also be written in terms of acceleration. Newton's second law can 
be written in the familiar form 


F = ma 

where a = dtv/dt is the acceleration. 

here nr is not the relativistic mass but is the invariant mass. 

In relativistic mechanics, momentum is p = rrr/v 

again nr being the invariant mass and the force is given by F = ^ = m d ^' > 

This form of force is used in the derivation of the expression for energy without relying on 
relativistic mass. 

It will be seen in the second section of this book that Newton's second law in terms of 
acceleration is given by: 

F = m 7 a+^fv 


3.15 Energy 


The debate over the use of the concept "relativistic mass" means that modern physics courses 
may forbid the use of this in the derivation of energy. The newer derivation of energy without 
using relativistic mass is given in the first section and the older derivation using relativistic 
mass is given in the second section. The two derivations can be compared to gain insight 
into the debate about mass but a knowledge of 4 vectors is really required to discuss the 
problem in depth. In principle the first derivation is most mathematically correct because 
"relativistic mass" is given by: M = , m = which involves the constants m and c. 

\Jl — U 2 /c 2 


3.15.1 Derivation of relativistic energy using the relativistic momentum 

In the following, modern derivation, m means the invariant mass - what used to be called 
the "rest mass". Energy is defined as the work done in moving a body from one place to 
another. We will make use of the relativistic momentum p = 'ymv. Energy is given from: 

dE = fr/x 

so, over the whole path: 

E = Jo f dx 

Kinetic energy (K) is the energy used to move a body from a velocity of 0 to a velocity u. 
Restricting the motion to one dimension: 

K=i:^fdx 
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Using the relativistic 3 force: 

K = IS ^dx = = Gmd(ru)% 

substituting for d^u) and using dx/dt = u: 

K = fu=o m ('ydu + ud'y)u 
Which gives: 

K = fu=o m(u'ydu + u 2 d'y) 

The Lorentz factor 7 is given by: 

1 

l—u 2 / c 2 

meaning that : 
d'y = ^7 3 du 
du = Ad~f 

u'y^ 1 

So that 

K = J£?m(u'y£d'y + u*d 1 ) = f^m,(^+u 2 )d'y = f^mcPd'y 
Alternatively, we can use the fact that: 

7 2 c 2 — 7 2 u 2 = c 2 
Differentiating: 

27c 2 d'y — 7 2 2 udu — u 2 ‘l r yd , y = 0 
So, rearranging: 

'yudu + u 2 d'y = c 2 d r y 
In which case: 

K = /“= 0 U ?n(u7d« + M 2 d7) = /“=“ mc 2 d7 

As u goes from 0 to u, the Lorentz factor 7 goes from 1 to 7, so: 

K = me 2 d'y 
and hence: 

K = 'ymc 2 — me 2 

The amount 'ymc 2 is known as the total energy of the particle. The amount me 2 is known 
as the rest energy of the particle. If the total energy of the particle is given the symbol E: 

E = 'ymc 2 = me 2 + K 

So it can be seen that me 2 is the energy of a mass that is stationary. This energy is known 
as mass energy. 

The Newtonian approximation for kinetic energy can be derived by using the binomial 
theorem to expand 7 = (1 — u 2 /c 2 )~ 2 . 
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The binomial expansion is: 

(■ a + x) n = a n + na n ~ 1 x+ n ^7 ^ a n ~ 2 x 2 .... 

So expanding (1 — u 2 /c 2 )~^: 

TS — 1 ™„.2 i 3 mu 4 , 5 mu 6 , 

a -2 mu +^c2- + T6yr + -" 

So if u is much less than c: 

K = \mu 2 

which is the Newtonian approximation for low velocities. 

3.15.2 Derivation of relativistic energy using the concept of relativistic 
mass 

Energy is defined as the work done in moving a body from one place to another. Energy is 
given from: 

dE = Fdx 

so, over the whole path: 

E = Jo Fdx 

Kinetic energy (K) is the energy used to move a body from a velocity of 0 to a velocity u. 
So: 

K = J^Fdx 

Using the relativistic force: 

is — f u=u d(Mu) , 
ly — Ju = 0 dt aX 

So: 

* = s::z d{Mu)f t 

substituting for d(Mu ) and using dx/dt = u: 

K = fyEy ( M du + udM ) u 
Which gives: 

K = fu=o {M udu + u 2 dM) 

The relativistic mass is given by: 



Which can be expanded as: 

M 2 c 2 — M 2 u 2 = m 2 c 2 
Differentiating: 

2 M<?dM - M 2 2udu - u 2 2MdM = 0 
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So, rearranging: 

Mudu + u 2 dM = c 2 dM 
In which case: 

K = (Mudu + u 2 dM ) 

is simplified to: 

K = j:= 0 u c 2 dM 

But the mass goes from m to M so: 

K = c 2 f^=m dM) 
and hence: 

K = Ale 2 — me 2 

The amount Ale 2 is known as the total energy of the particle. The amount me 2 is known 
as the rest energy of the particle. If the total energy of the particle is given the symbol E: 

E = me 2 + K 

So it can be seen that me 2 is the energy of a mass that is stationary. This energy is known as 
mass energy and is the origin of the famous formula E = me 2 that is iconic of the nuclear 
age. 

The Newtonian approximation for kinetic energy can be derived by substituting the rest 
mass for the relativistic mass ie: 


and: 

K = Ale 2 — me 2 


So: 
K = 


-me 2 


yJl — U 2 /c 


ie: 


K = mc 2 ((l — u 2 /c 2 ) 2—1) 

The binomial theorem can be used to expand (1 — u 2 /c 2 )~ 2 : 


The binomial theorem is: 

( a + x) n = a n + na n ~ 1 x + n ^7 — a n ~ 2 x 2 .... 

So expanding (1 — u 2 /c 2 )~^: 

TS — 1™„.2 1 3 mu A 1 5 mu 6 , 

A - 2 mu + -gj2- + yg-r + ••• 

So if u is much less than c: 


K = T^mu 2 

Which is the Newtonian approximation for low velocities. 
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3.16 Nuclear Energy 

When protons and neutrons (nucleons) combine to form elements the combination of particles 
tends to be in a lower energy state than the free neutrons and protons. Iron has the lowest 
energy and elements above and below iron in the scale of atomic masses tend to have higher 
energies. This decrease in energy as neutrons and protons bind together is known as the 
binding energy. The atomic masses of elements are slightly different from that calculated 
from their constituent particles and this difference in mass energy, calculated from E = me 2 , 
is almost exactly equal to the binding energy. 

The binding energy can be released by converting elements with higher masses per nucleon 
to those with lower masses per nucleon. This can be done by either splitting heavy elements 
such as uranium into lighter elements such as barium and krypton or by joining together 
light elements such as hydrogen into heavier elements such as deuterium. If atoms are split 
the process is known as nuclear fission and if atoms are joined the process is known as 
nuclear fusion. Atoms that are lighter than iron can be fused to release energy and those 
heavier than iron can be split to release energy. 

When hydrogen and a neutron are combined to make deuterium the energy released can be 
calculated as follows: 

The mass of a proton is 1.00731 amu, the mass of a neutron is 1.00867 amu and the mass of 
a deuterium nucleus is 2.0136 amu. The difference in mass between a deuterium nucleus 
and its components is 0.00238 amu. The energy of this mass difference is: 

E = me 2 = 1.66 x 10~ 27 x 0.00238 x (3 x 10 8 ) 2 

So the energy released is 3.57 x 10 -13 joules or about 2 x 10 11 joules per gram of protons 
(ionised hydrogen). 

(Assuming 1 amu = 1.66 x 10~ 2 ' Kg, Avogadro's number = 6 x 10 23 and the speed of light 
is 3 x 10 8 metres per second) 

Present day nuclear reactors use a process called nuclear fission in which rods of uranium 
emit neutrons which combine with the uranium in the rod to produce uranium isotopes 
such as 236 U which rapidly decay into smaller nuclei such as Barium and Krypton plus three 
neutrons which can cause further generation of 236 U and further decay. The fact that each 
neutron can cause the generation of three more neutrons means that a self sustaining or 
chain reaction can occur. The generation of energy results from the equivalence of mass 
and energy; the decay products, barium and krypton have a lower mass than the original 
236 U, the missing mass being released as 177 MeV of radiation. The nuclear equation for 
the decay of 236 U is written as follows: 

IfU Ba +|| Kr + 3 n + 177 MeV 


75 



Spacetime 



Figure 43 Nuclear explosion 


If a large amount of the uranium isotope 235 U (the critical mass) is confined the chain 
reaction can get out of control and almost instantly release a large amount of energy. A 
device that confines a critical mass of uranium is known as an atomic bomb or A-bomb. 
A bomb based on the fusion of deuterium atoms is known as a thermonuclear bomb, 
hydrogen bomb or H-bomb. 
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4.1 Introduction 

Many students confuse Relativity Theory with a theory about the propagation of light. 
According to modern Relativity Theory the constancy of the speed of light is a consequence 
of the geometry of spacetime rather than something specifically due to the properties of 
photons; but the statement "the speed of light is constant" often distracts the student into a 
consideration of light propagation. This confusion is amplified by the importance assigned 
to interferometry experiments, such as the Michelson-Morley experiment, in most textbooks 
on Relativity Theory. 

The history of theories of the propagation of light is an interesting topic in physics and was 
indeed important in the early days of Relativity Theory. In the seventeenth century two 
competing theories of light propagation were developed. Christiaan Huygens published a 
wave theory of light which was based on Huygen's principle whereby every point in a 
wavelike disturbance can give rise to further disturbances that spread out spherically. In 
contrast Newton considered that the propagation of light was due to the passage of small 
particles or "corpuscles" from the source to the illuminated object. His theory is known as the 
corpuscular theory of light. Newton's theory was widely accepted until the nineteenth 
century. 

In the early nineteenth century Thomas Young performed his Young's slits experiment 
and the interference pattern that occurred was explained in terms of diffraction due to the 
wave nature of light. The wave theory was accepted generally until the twentieth century 
when quantum theory confirmed that light had a corpuscular nature and that Huygen's 
principle could not be applied. 

The idea of light as a disturbance of some medium, or aether, that permeates the universe 
was problematical from its inception (US spelling: "ether"). The first problem that arose was 
that the speed of light did not change with the velocity of the observer. If light were indeed 
a disturbance of some stationary medium then as the earth moves through the medium 
towards a light source the speed of light should appear to increase. It was found however 
that the speed of light did not change as expected. Each experiment on the velocity of light 
required corrections to existing theory and led to a variety of subsidiary theories such as the 
"aether drag hypothesis". Ultimately it was experiments that were designed to investigate 
the properties of the aether that provided the first experimental evidence for Relativity 
Theory. 
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4.2 The aether drag hypothesis 

The aether drag hypothesis was an early attempt to explain the way experiments such as 
Arago's experiment showed that the speed of light is constant. The aether drag hypothesis 
is now considered to be incorrect. 

According to the aether drag hypothesis light propagates in a special medium, the aether, 
that remains attached to things as they move. If this is the case then, no matter how fast 
the earth moves around the sun or rotates on its axis, light on the surface of the earth would 
travel at a constant velocity. 



V V 


Figure 44 Stellar Aberration. If a telescope is travelling at high speed, only light 
incident at a particular angle can avoid hitting the walls of the telescope tube. 
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The primary reason the aether drag hypothesis is considered invalid is because of the 
occurrence of stellar aberration. In stellar aberration the position of a star when viewed 
with a telescope swings each side of a central position by about 20.5 seconds of arc every six 
months. This amount of swing is the amount expected when considering the speed of earth's 
travel in its orbit. In 1871, George Biddell Airy 1 demonstrated that stellar aberration occurs 
even when a telescope is filled with water. It seems that if the aether drag hypothesis were 
true then stellar aberration would not occur because the light would be travelling in the 
aether which would be moving along with the telescope. 


A 

f] 






Figure 45 The "train analogy" for the absence of aether drag. 


1 http : //en . wikibooks . org/wiki/"/,3Aw"/,3AGeorge"/,20BiddelT/,20Airy 
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If you visualize a bucket on a train about to enter a tunnel and a drop of water drips from 
the tunnel entrance into the bucket at the very centre, the drop will not hit the centre at 
the bottom of the bucket. The bucket is the tube of a telescope, the drop is a photon and 
the train is the earth. If aether is dragged then the droplet would be travelling with the 
train when it is dropped and would hit the centre of bucket at the bottom. 

The amount of stellar aberration, a is given by: 


tan(a) = v6t/cSt 


So: 


tan(a) = v/c 

The speed at which the earth goes round the sun, v = 30 krn/s, and the speed of light is c 
= 300,000,000 m/s which gives a = 20.5 seconds of arc every six months. This amount of 
aberration is observed and this contradicts the aether drag hypothesis. 

In 1818, Augustin Jean Fresnel 2 introduced a modification to the aether drag hypothesis 
that only applies to the interface between media. This was accepted during much of the 
nineteenth century but has now been replaced by special theory of relativity (see below). 

The aether drag hypothesis is historically important because it was one of the reasons why 
Newton's corpuscular theory of light was replaced by the wave theory and it is used in early 
explanations of light propagation without relativity theory. It originated as a result of early 
attempts to measure the speed of light. 

In 1810, Frangois Arago 3 realised that variations in the refractive index of a substance 
predicted by the corpuscular theory would provide a useful method for measuring the velocity 
of light. These predictions arose because the refractive index of a substance such as glass 
depends on the ratio of the velocities of light in air and in the glass. Arago attempted to 
measure the extent to which corpuscles of light would be refracted by a glass prism at the 
front of a telescope. He expected that there would be a range of different angles of refraction 
due to the variety of different velocities of the stars and the motion of the earth at different 
times of the day and year. Contrary to this expectation he found that that there was no 
difference in refraction between stars, between times of day or between seasons. All Arago 
observed was ordinary stellar aberration. 

In 1818 Fresnel examined Arago's results using a wave theory of light. He realised that 
even if light were transmitted as waves the refractive index of the glass-air interface should 
have varied as the glass moved through the aether to strike the incoming waves at different 
velocities when the earth rotated and the seasons changed. 

Fresnel proposed that the glass prism would carry some of the aether along with it so that 
"...the aether is in excess inside the prism". He realised that the velocity of propagation of 
waves depends on the density of the medium so proposed that the velocity of light in the 
prism would need to be adjusted by an amount of 'drag'. 


2 http : //en . wikibooks . org/wiki/y o 3Awy o 3AAugustiiiyo20Jean'/o20Fresnel 

3 http : //en. wikibooks . org/wiki/y o 3Awyo3AFranyoE7oisy o 20Arago 
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The velocity of light v n in the glass without any adjustment is given by: 


v n = c/n 


The drag adjustment v d is given by: 


v d = v(l ) 

Pa 

Where p e is the aether density in the environment, p g is the aether density in the glass and 
v is the velocity of the prism with respect to the aether. 

The factor (1 — y-) can be written as (1 — 4*) because the refractive index, n, would be 
dependent on the density of the aether. This is known as the Fresnel drag coefficient. 

The velocity of light in the glass is then given by: 


V=- + v(l 
n 


n * 


) 


This correction was successful in explaining the null result of Arago's experiment. It 
introduces the concept of a largely stationary aether that is dragged by substances such 
as glass but not by air. Its success favoured the wave theory of light over the previous 
corpuscular theory. 

The Fresnel drag coefficient was confirmed by an interferometer experiment performed by 
Fizeau. Water was passed at high speed along two glass tubes that formed the optical paths 
of the interferometer and it was found that the fringe shifts were as predicted by the drag 
coefficient. 
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The Fizeau Experiment 
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Figure 46 


The special theory of relativity predicts the result of the Fizeau experiment from the velocity 
addition theorem without any need for an aether. 

If V is the velocity of light relative to the Fizeau apparatus and U is the velocity of light 
relative to the water and v is the velocity of the water: 


U = 


V = 


n 

c/n + v 


l + v/nc 

which, if v/c is small can be expanded using the binomial expansion to become: 


V=- + V (l-^) 

n n z 


This is identical to Fresnel's equation. 
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It may appear as if Fresnel's analysis can be substituted for the relativistic approach, however, 
more recent work has shown that Fresnel's assumptions should lead to different amounts of 
aether drag for different frequencies of light and violate Snell's law (see Ferraro and Sforza 
(2005)). 

The aether drag hypothesis was one of the arguments used in an attempt to explain the 
Michelson-Morley experiment before the widespread acceptance of the special theory of 
relativity. 

The Fizeau experiment is consistent with relativity and approximately consistent with each 
individual body, such as prisms, lenses etc. dragging its own aether with it. This contradicts 
some modified versions of the aether drag hypothesis that argue that aether drag may 
happen on a global (or larger) scale and stellar aberration is merely transferred into the 
entrained "bubble" around the earth which then faithfully carries the modified angle of 
incidence directly to the observer. 

References 

• Rafael Ferraro and Daniel M Sforza 2005. Arago (1810): the first experimental result 
against the ether Eur. J. Phys. 26 195-204 4 


4.3 The Michelson-Morley experiment 

The Michelson-Morley experiment, one of the most important and famous experiments 
in the history of physics, was performed in 1887 by Albert Michelson and Edward Morley 
at what is now Case Western Reserve University, and is considered to be the first strong 
evidence against the theory of a luminiferous aether. 

Physics theories of the late 19th century postulated that, just as water waves must have a 
medium to move across (water), and audible sound waves require a medium to move through 
(air), so also light waves require a medium, the "luminiferous aether". The speed of light 
being so great, designing an experiment to detect the presence and properties of this aether 
took considerable thought. 


4 http : / / arXiv. org/pdf /physics/0412055 
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4.3.1 Measuring aether 
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Figure 47 


A depiction of the concept of the “aether wind”. Each year, the Earth travels a tremendous 
distance in its orbit around the sun, at a speed of around 30 km/second, over 100,000 km 
per hour. It was reasoned that the Earth would at all times be moving through the aether 
and producing a detectable "aether wind". At any given point on the Earth's surface, the 
magnitude and direction of the wind would vary with time of day and season. By analysing 
the effective wind at various different times, it should be possible to separate out components 
due to motion of the Earth relative to the Solar System from any due to the overall motion 
of that system. 

The effect of the aether wind on light waves would be like the effect of wind on sound waves. 
Sound waves travel at a constant speed relative to the medium that they are travelling 
through (this varies depending on the pressure, temperature etc (see sound), but is typically 
around 340 m/s). So, if the speed of sound in our conditions is 340 m/s, when there is a 10 
m/s wind relative to the ground, into the wind it will appear that sound is travelling at 330 
m/s (340 - 10). Downwind, it will appear that sound is travelling at 350 m/s (340 + 10). 
Measuring the speed of sound compared to the ground in different directions will therefore 
enable us to calculate the speed of the air relative to the ground. 

If the speed of the sound cannot be directly measured, an alternative method is to measure 
the time that the sound takes to bounce off of a reflector and return to the origin. This is 
done parallel to the wind and perpendicular (since the direction of the wind is unknown 
before hand, just determine the time for several different directions). The cumulative round 
trip effects of the wind in the two orientations slightly favors the sound travelling at right 
angles to it. Similarly, the effect of an aether wind on a beam of light would be for the beam 
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to take slightly longer to travel round-trip in the direction parallel to the “wind” than to 
travel the same round-trip distance at right angles to it. 

“Slightly” is key, in that, over a distance such as a few meters, the difference in time for 
the two round trips would be only about a millionth of a millionth of a second. At this 
point the only truly accurate measurements of the speed of light were those carried out by 
Albert Abraham Michelson, which had resulted in measurements accurate to a few meters 
per second. While a stunning achievement in its own right, this was certainly not nearly 
enough accuracy to be able to detect the aether. 


4.3.2 The experiments 

Michelson, though, had already seen a solution to this problem. His design, later known as 
an interferometer, sent a single source of white light through a half-silvered mirror that was 
used to split it into two beams travelling at right angles to one another. After leaving the 
splitter, the beams travelled out to the ends of long arms where they were reflected back 
into the middle on small mirrors. They then recombined on the far side of the splitter in 
an eyepiece, producing a pattern of constructive and destructive interference based on the 
length of the arms. Any slight change in the amount of time the beams spent in transit 
would then be observed as a shift in the positions of the interference fringes. If the aether 
were stationary relative to the sun, then the Earth's motion would produce a shift of about 
0.04 fringes. 

Michelson had made several measurements with an experimental device in 1881, in which 
he noticed that the expected shift of 0.04 was not seen, and a smaller shift of about 0.02 
was. However his apparatus was a prototype, and had experimental errors far too large to 
say anything about the aether wind. For a measurement of the aether wind, a much more 
accurate and tightly controlled experiment would have to be carried out. The prototype 
was, however, successful in demonstrating that the basic method was feasible. 
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Mirror 



He then combined forces with Edward Morley and spent a considerable amount of time and 
money creating an improved version with more than enough accuracy to detect the drift. In 
their experiment the light was repeatedly reflected back and forth along the arms, increasing 
the path length to 11m. At this length the drift would be about .4 fringes. To make that 
easily detectable the apparatus was located in a closed room in the basement of a stone 
building, eliminating most thermal and vibrational effects. Vibrations were further reduced 
by building the apparatus on top of a huge block of marble, which was then floated in a pool 
of mercury. They calculated that effects of about l/100th of a fringe would be detectable. 

The mercury pool allowed the device to be turned, so that it could be rotated through the 
entire range of possible angles to the "aether wind". Even over a short period of time some 
sort of effect would be noticed simply by rotating the device, such that one arm rotated 
into the direction of the wind and the other away. Over longer periods day /night cycles or 
yearly cycles would also be easily measurable. 

During each full rotation of the device, each arm would be parallel to the wind twice (facing 
into and away from the wind) and perpendicular to the wind twice. This effect would show 
readings in a sine wave formation with two peaks and two troughs. Additionally if the wind 
was only from the earth's orbit around the sun, the wind would fully change directions 
east/west during a 12 hour period. In this ideal conceptualization, the sine wave of day/night 
readings would be in opposite phase. 

Because it was assumed that the motion of the solar system would cause an additional 
component to the wind, the yearly cycles would be detectable as an alteration of the 
maginitude of the wind. An example of this effect is a helicopter flying forward. While on 
the ground, a helicopter's blades would be measured as travelling around at 50 km/h at the 
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tips. However, if the helicopter is travelling forward at 50 km/h, there are points at which 
the tips of the blades are travelling 0 km/h and 100 km/h with respect to the air they are 
travelling through. This increases the magnitude of the lift on one side and decreases it on 
the other just as it would increase and decrease the magnitude of an ether wind on a yearly 
basis. 


4.3.3 The most famous failed experiment 

Ironically, after all this thought and preparation, the experiment became what might be 
called the most famous failed experiment to date. Instead of providing insight into the 
properties of the aether, Michelson and Morley's 1887 article 5 in the American Journal of 
Science reported the measurement to be as small as one-fortieth of the expected displacement 
but "since the displacement is proportional to the square of the velocity" they concluded 
that the measured velocity was approximately one-sixth of the expected velocity of the 
Earth's motion in orbit and "certainly less than one-fourth". Although this small "velocity" 
was measured, it was considered far too small to be used as evidence of aether, it was later 
said to be within the range of an experimental error that would allow the speed to actually 
be zero. 

Although Michelson and Morley went on to different experiments after their first publication 
in 1887, both remained active in the field. Other versions of the experiment were carried out 
with increasing sophistication. Kennedy and Illingsworth both modified the mirrors to include 
a half-wave "step", eliminating the possibility of some sort of standing wave pattern within 
the apparatus. Illingsworth could detect changes on the order of l/300th of a fringe, Kennedy 
up to l/1500th. Miller later built a non- magnetic device to eliminate magnetostriction, 
while Michelson built one of non-expanding invar to eliminate any remaining thermal effects. 
Others from around the world increased accuracy, eliminated possible side effects, or both. 
All of these with the exception of Dayton Miller also returned what is considered a null 
result. 

Morley was not convinced of his own results, and went on to conduct additional experiments 
with Dayton Miller. Miller worked on increasingly large experiments, culminating in one 
with a 32m (effective) arm length at an installation at the Mount Wilson observatory. To 
avoid the possibility of the aether wind being blocked by solid walls, he used a special shed 
with thin walls, mainly of canvas. He consistently measured a small positive effect that 
varied, as expected, with each rotation of the device, the sidereal day and on a yearly basis. 
The low magnitude of the results he attributed to aether entrainment (see below). His 
measurements amounted to only '10 kps instead of the expected '30 kps expected from the 
earth's orbital motion alone. He remained convinced this was due to partial entrainment, 
though he did not attempt a detailed explanation. 

Though Kennedy later also carried out an experiment at Mount Wilson, finding 1/10 the 
drift measured by Miller, and no seasonal effects, Miller's findings were considered important 
at the time, and were discussed by Michelson, Hendrik Lorentz and others at a meeting 
reported in 1928 (ref below). There was general agreement that more experimentation was 
needed to check Miller's results. Lorentz recognised that the results, whatever their cause, 


5 http : //www . anti-relativity . com/MM_Paper .pdf 


87 



aether 


did not quite tally with either his or Einstein's versions of special relativity. Einstein was 
not present at the meeting and felt the results could be dismissed as experimental error (see 
Shankland ref below) . 
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In recent times versions of the MM experiment have become commonplace. Lasers and 
masers amplify light by repeatedly bouncing it back and forth inside a carefully tuned cavity, 
thereby inducing high-energy atoms in the cavity to give off more light. The result is an 
effective path length of kilometers. Better yet, the light emitted in one cavity can be used 
to start the same cascade in another set at right angles, thereby creating an interferometer 
of extreme accuracy. 

The first such experiment was led by Charles H. Townes, one of the co-creators of the 
first maser. Their 1958 experiment put an upper limit on drift, including any possible 
experimental errors, of only 30 m/s. In 1974 a repeat with accurate lasers in the triangular 
Trimmer experiment reduced this to 0.025 m/s, and included tests of entrainment by placing 
one leg in glass. In 1979 the Brillet-Hall experiment put an upper limit of 30 m/s for any 
one direction, but reduced this to only 0.000001 m/s for a two-direction case (ie, still or 
partially entrained aether). A year long repeat known as Hils and Hall, published in 1990, 
reduced this to 2xl0" 13 . 

4.3.4 Fallout 

This result was rather astounding and not explainable by the then-current theory of wave 
propagation in a static aether. Several explanations were attempted, among them, that the 
experiment had a hidden flaw (apparently Michelson's initial belief), or that the Earth's 
gravitational field somehow "dragged" the aether around with it in such a way as locally 
to eliminate its effect. Miller would have argued that, in most if not all experiments other 
than his own, there was little possibility of detecting an aether wind since it was almost 
completely blocked out by the laboratory walls or by the apparatus itself. Be this as it may, 
the idea of a simple aether, what became known as the First Postulate, had been dealt a 
serious blow. 

A number of experiments were carried out to investigate the concept of aether dragging, or 
entrainment. The most convincing was carried out by Hamar, who placed one arm of the 
interferometer between two huge lead blocks. If aether were dragged by mass, the blocks 
would, it was theorised, have been enough to cause a visible effect. Once again, no effect 
was seen. 

Walter Ritz's Emission theory (or ballistic theory), was also consistent with the results of 
the experiment, not requiring aether, more intuitive and paradox-free. This became known 
as the Second Postulate. However it also led to several "obvious" optical effects that were 
not seen in astronomical photographs, notably in observations of binary stars in which the 
light from the two stars could be measured in an interferometer. 

The Sagnac experiment placed the MM apparatus on a constantly rotating turntable. In 
doing so any ballistic theories such as Ritz's could be tested directly, as the light going one 
way around the device would have different length to travel than light going the other way 
(the eyepiece and mirrors would be moving toward/away from the light). In Ritz's theory 
there would be no shift, because the net velocity between the light source and detector was 
zero (they were both mounted on the turntable). However in this case an effect was seen, 
thereby eliminating any simple ballistic theory. This fringe-shift effect is used today in laser 
gyroscopes. 
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Another possible solution was found in the Lorentz-Fitz Gerald contraction hypothesis. In 
this theory all objects physically contract along the line of motion relative to the aether, so 
while the light may indeed transit slower on that arm, it also ends up travelling a shorter 
distance that exactly cancels out the drift. 

In 1932 the Kennedy-Thorndike experiment modified the Michelson-Morley experiment by 
making the path lengths of the split beam unequal, with one arm being very long. In this 
version the two ends of the experiment were at different velocities due to the rotation of the 
earth, so the contraction would not "work out" to exactly cancel the result. Once again, no 
effect was seen. 

Ernst Mach was among the first physicists to suggest that the experiment actually amounted 
to a disproof of the aether theory. The development of what became Einstein's special theory 
of relativity had the Fitzgerald-Lorentz contraction derived from the invariance postulate, 
and was also consistent with the apparently null results of most experiments (though not, as 
was recognised at the 1928 meeting, with Miller's observed seasonal effects). Today relativity 
is generally considered the "solution" to the MM null result. 

The Trouton-Noble experiment is regarded as the electrostatic equivalent of the Michelson- 
Morley optical experiment, though whether or not it can ever be done with the necessary 
sensitivity is debatable. On the other hand, the 1908 Trouton-Rankine experiment that 
spelled the end of the Lorentz-FitzGerald contraction hypothesis achieved an incredible 
sensitivity. 
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4.4 Mathematical analysis of the Michelson Morley 
Experiment 


The Michelson interferometer splits light into rays that travel along two paths then recombines 
them. The recombined rays interfere with each other. If the path length changes in one of 
the arms the interference pattern will shift slightly, moving relative to the cross hairs in 


6 http : //www .datasync . com/~rsf 1/ crit/ 1908a.htm 

7 http : //www .datasync . com/~rsf 1/desit- le .htm 

8 http : //www . wbabin .net/ sf arti/ sf art i 12 .pdf 
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the telescope. The Michelson interferometer is arranged as an optical bench on a concrete 
block that floats on a large pool of mercury. This allows the whole apparatus to be rotated 
smoothly. 

If the earth were moving through an aether at the same velocity as it orbits the sun (30 
km/sec) then Michelson and Morley calculated that a rotation of the apparatus should cause 
a shift in the fringe pattern. The basis of this calculation is given below. 


A Michelson Interferometer 


Mirror 2 Mirror 2 




n w * 

The path of the beam on the supposition 
that the apparatus is moving at velocity v 
jjl with respect to an aether. 


Figure 49 


Consider the time taken t\ for light to travel along Path 1 in the illustration: 


t\ = — + ^- 

c — v c + v 


Rearranging terms: 


_AL + JtL = 2L f c 

c — v c-\-v c 2 — V 2 


further rearranging: 

2 LjC 2Z/j \ 

c 2 —v 2 c X—v 2 /c 2 

hence: 


ti 


2 Lf i 

C 1 — V 2 / c 2 


Considering Path 2, the light traces out two right angled triangles so: 


ct 2 = 2y / T 2 n + (uf 2 /2) 2 
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Rearranging: 


h 


2 L. n 




So: 


, m 

c a/i - [y/c) 2 

It is now easy to calculate the difference (At between the times spent by the light in Path 1 
and Path 2: 


A t = - 


Lr 


L 


f 


C Y a/1 — v 2 /c 2 l — v 2 /c 2 J 
If the apparatus is rotated by 90 degrees the new time difference is: 


At 




-v 2 /c 2 a/ 1-v' 2 /c 


The interference fringes due to the time difference between the paths will be different after 
rotation if At and At are different. 


At 
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"V 1 
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This difference between the two times can be calculated if the binomial expansions of 1 _ t) 1 2 y c2 

and , 1 „ are used: 
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So: 
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7 


If the period of one vibration of the light is T then the number of fringes (n) , that will move 
past the cross hairs of the telescope when the apparatus is rotated will be: 


n = 


A t'- At 
T ~ 
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Inserting the formula for At' At 


n : 


L f + L m v 2 


cT c 2 

But cT for a light wave is the wavelength of the light ie: cT = A so: 


n : 


If the wavelength of the light is 5 x 10 7 and the total path length is 20 metres then: 


Lf + L m v 2 
A c 2 


n = 


20 

5 x 10- 7 


10" 8 


So the fringes will shift by 0.4 fringes (ie: 40%) when the apparatus is rotated. 

However, no fringe shift is observed. The null result of the Michelson-Morley experiment is 
nowdays explained in terms of the constancy of the speed of light. The assumption that 
the light would have a velocity of c — v and c + v depending on the direction relative to the 
hypothetical "aether wind" is false, the light always travels at c between two points in a 
vacuum and the speed of light is not affected by any "aether wind". This is because, in 
{special relativity} the Lorentz transforms induce a {length contraction}. Doing over the 
above calculations we obtain: 


L f = L m \j\ — v 2 /c 2 

(taking into consideration the length contraction) 

It is now easy to recalculate the difference (At between the times spent by the light in Path 
1 and Path 2: 

At = - ( 7 Lm — , L L ,, ) = 0 because Lt = L m J\ — v 2 /c 2 

c y^/i- v 2/ c 2 i-v 2 /c- J i m v / 

If the apparatus is rotated by 90 degrees the new time difference is: 

/ 2 f L m P/ \ q 

c V pi -v‘ 2 /c 2 J 

The interference fringes due to the time difference between the paths will be different after 
rotation if At and At are different. 



Lm + L j 
1 — v^/c^ 


L f+ L m \_ n 

V I — 1'2/ C 2 J 
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4.4.1 Coherence length 

The coherence length of light rays from a source that has wavelengths that differ by AA is: 

A 2 

''' “ 2ttAA 

If path lengths differ by more than this amount then interference fringes will not be observed. 
White light has a wide range of wavelengths and interferometers using white light must have 
paths that are equal to within a small fraction of a millimetre for interference to occur. This 
means that the ideal light source for a Michelson Interferometer should be monochromatic 
and the arms should be as near as possible equal in length. 

The calculation of the coherence length is based on the fact that interference fringes become 
unclear when light rays are about 60 degrees (about 1 radian or one sixth of a wavelength 
(~ 1/27 t)) out of phase. This means that when two beams are: 

_A_ 

27T 

metres out of step they will no longer give a well defined interference pattern. Suppose a 
light beam contains two wavelengths of light, A and A + AA, then in: 

A 

2ttAA 


cycles they will be ^ out of phase. 

The distance required for the two different wavelengths of light to be this much out of phase 
is the coherence length. Coherence length = number of cycles x length of each cycle so: 

coherence length = 2 n&.\ • 


4.5 Lorentz-Fitzgerald Contraction Hypothesis 

After the first Michelson- Mor ley experiments in 1881 there were several attempts to explain 
the null result. The most obvious point of attack is to propose that the path that is parallel 
to the direction of motion is contracted by i/l — v 2 /c 2 in which case At and At would be 
identical and no fringe shift would occur. This possibility was proposed in 1892 by Fitzgerald. 
Lorentz produced an "electron theory of matter" that would account for such a contraction. 

Students sometimes make the mistake of assuming that the Lorentz-Fitzgerald contraction 
is equivalent to the Lorentz transformations. However, in the absence of any treatment of 
the time dilation effect the Lorentz-Fitgerald explanation would result in a fringe shift if the 
apparatus is moved between two different velocities. The rotation of the earth allows this 
effect to be tested as the earth orbits the sun. Kennedy and Thorndike (1932) performed 
the Michelson-Morley experiment with a highly sensitive apparatus that could detect any 
effect due to the rotation of the earth; they found no effect. They concluded that both time 
dilation and Lorentz-Fitzgerald Contraction take place, thus confirming relativity theory. 
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If only the Lor entz- Fitzgerald contraction applied then the fringe shifts due to changes 
in velocity would be: n = (v\ — v 2 ) /c 2 x (Lf — L m )/\. Notice how the sensitivity of the 
experiment is dependent on the difference in path length Lf — L m and hence a long coherence 
length is required. 
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• Early Experiments 12 
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5.1 Vectors 

Physical effects involve things acting on other things to produce a change of position, tension 
etc. These effects usually depend upon the strength, angle of contact, separation etc of the 
interacting things rather than on any absolute reference frame so it is useful to describe 
the rules that govern the interactions in terms of the relative positions and lengths of the 
interacting things rather than in terms of any fixed viewpoint or coordinate system. Vectors 
were introduced in physics to allow such relative descriptions. 

The use of vectors in elementary physics often avoids any real understanding of what they 
are. They are a new concept, as unique as numbers themselves, which have been related to 
the rest of mathematics and geometry by a series of formulae such as linear combinations, 
scalar products etc. 

Vectors are defined as "directed line segments" which means they are lines drawn in a 
particular direction. The introduction of time as a geometric entity means that this definition 
of a vector is rather archaic, a better definition might be that a vector is information arranged 
as a continuous succession of points in space and time. Vectors have length and direction, 
the direction being from earlier to later. 

Vectors are represented by lines terminated with arrow symbols to show the direction. A 
point that moves from the left to the right for about three centimetres can be represented as: 


> 

Symbol for a vector 


Figure 50 
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If a vector is represented within a coordinate system it has components along each of the 
axes of the system. These components do not normally start at the origin of the coordinate 
system. 


z 



Figure 51 


The vector represented by the bold arrow has components a, b and c which are lengths on 
the coordinate axes. If the vector starts at the origin the components become simply the 
coordinates of the end point of the vector and the vector is known as the position vector of 
the end point. 

5.1.1 Addition of Vectors 

If two vectors are connected so that the end point of one is the start of the next the sum of 
the two vectors is defined as a third vector drawn from the start of the first to the end of 
the second: 
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C 


Figure 52 

c is the sum of a and b: 
c = a + b 

If a components of a are a, b, c and the components of b are d, e, f then the components 
of the sum of the two vectors are (a+d), (b+e) and (c+f). In other words, when vectors 
are added it is the components that add numerically rather than the lengths of the vectors 
themselves. 

Rules of Vector Addition 

1. Commutativity a + b = b + a 

2. Associativity (a + b) + c = a + (b + c) 

If the zero vector (which has no length) is labelled as 0 

3. a + (-a) = 0 

4. a + 0 = a 


5.1.2 Rules of Vector Multiplication by a Scalar 

The discussion of components and vector addition shows that if vector a has components 
a,b,c then qa has components qa, qb, qc. The meaning of vector multiplication is shown 
below: 
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Figure 53 

The bottom vector c is added three times which is equivalent to multiplying it by 3. 

1. Distributive laws q(a + b) = qa + qb and (q + p)a = qa + pa 

2. Associativity q(pa) = qpa 
Also 1 a = a 

If the rules of vector addition and multiplication by a scalar apply to a set of elements they 
are said to define a vector space. 

5.1.3 Linear Combinations and Linear Dependence 

An element of the form: 

giai + q , 2 a 2 + (/3 a 3 + •••• + Qm a m 

is called a linear combination of the vectors. 

The set of vectors multiplied by scalars in a linear combination is called the span of the 
vectors. The word span is used because the scalars (q) can have any value - which means 
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that any point in the subset of the vector space defined by the span can contain a vector 
derived from it. 

Suppose there were a set of vectors ( 01 , 02 , ....,a m ) , if it is possible to express one of these 
vectors in terms of the others, using any linear combination, then the set is said to be 
linearly dependent. If it is not possible to express any one of the vectors in terms of the 
others, using any linear combination, it is said to be linearly independent. 

In other words, if there are values of the scalars such that: 

(1). ai = Q2 a 2 + 93 a 3 + •••• + <?m a m 

the set is said to be linearly dependent. 

There is a way of determining linear dependence. From (1) it can be seen that if q\ is set to 
minus one then: 

<?l a l + ?2 a 2 + (/3 a 3 + •••• + <7m a m = 0 

So in general, if a linear combination can be written that sums to a zero vector then the set 
of vectors (ai,a 2 , ....,a m ) are not linearly independent. 

If two vectors are linearly dependent then they lie along the same line (wherever a and b lie 
on the line, scalars can be found to produce a linear combination which is a zero vector). If 
three vectors are linearly dependent they lie on the same line or on a plane (collinear or 
coplanar) . 

5.1.4 Dimension 

If n+ 1 vectors in a vector space are linearly dependent then n vectors are linearly independent 
and the space is said to have a dimension of n. The set of n vectors is said to be the basis of 
the vector space. 

5.1.5 Scalar Product 

Also known as the 'dot product' or 'inner product'. The scalar product is a way of removing 
the problem of angular measures from the relationship between vectors and, as Weyl put it, 
a way of comparing the lengths of vectors that are arbitrarily inclined to each other. 

Consider two vectors with a common origin: 
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P 


Figure 54 

The projection of a on the adjacent side is: 

P = |a| cosO 

Where |a| is the length of a. 

The scalar product is defined as: 

(2) a.b = laHblcos# 

Notice that cosO is zero if a and b are perpendicular. This means that if the scalar product 
is zero the vectors composing it are orthogonal (perpendicular to each other). 

(2) also allows cosO to be defined as: 
cosO = a.b/(|a||b|) 

The definition of the scalar product also allows a definition of the length of a vector in terms 
of the concept of a vector itself. The scalar product of a vector with itself is: 

a. a = |a| |a|cosO 

cos 0 (the cosine of zero) is one so: 
a.a = a 2 

which is our first direct relationship between vectors and scalars. This can be expressed as: 

(3) a = \faP& 
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where a is the length of a. 

Properties: 

1 . Linearity [G& + Hb].c = Ga.c + iLb.c 

2. symmetry a.b = b.a 

3 . Positive definiteness a. a is greater than or equal to 0 

4 . Distributivity for vector addition (a + b).c = a.c + b.c 

5 . Schwarz inequality | a.b | < ab 

6. Parallelogram equality |a + b| 2 + |a — b| 2 = 2(|a| 2 + |b| 2 ) 

From the point of view of vector physics the most important property of the scalar product 
is the expression of the scalar product in terms of coordinates. 

7 . a.b = 0161 + 0262 + a 3 6 3 

This gives us the length of a vector in terms of coordinates (Pythagoras' theorem) from: 

8. a. a = a 2 = a\ + a| + a| 

The derivation of 7 is: 

a = aii + 02 j + a 3 k 

where i,j,k are unit vectors along the coordinate axes. From ( 4 ) 
a.b = (aii + a 2 j +a 3 k).b = aii.b + o 2 j.b + a 3 k.b 
but b = 6ii + 6 2 j + 6 3 k 
so: 

a.b = 6iaii.i + 6 2 oii.j + 6 3 aii.k + 6ia 2 j.i + 62 02j.j + 6 3 a2j.k + 6ia 3 k.i + 6 2 a 3 k.j + 6 3 a 3 k.k 

i.j,i.k,j.k, etc. are all zero because the vectors are orthogonal, also i.i, j .j and k.k are all 
one (these are unit vectors defined to be 1 unit in length). 

Using these results: 

a.b = a 3 6i + a 2 & 2 + a 3 & 3 

5.2 Matrices 

Matrices are sets of numbers arranged in a rectangular array. They are especially important 
in linear algebra because they can be used to represent the elements of linear equations. 

11a + 2b = c 

5 a + 7 b = d 

The constants in the equation above can be represented as a matrix: 
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A = 


11 

5 


2 

7 


The elements of matrices are usually denoted symbolically using lower case letters: 


A = 


an 

021 


012 

022 


Matrices are said to be equal if all of the corresponding elements are equal. 
Eg: if a,ij = b^ 

Then A = B 


5.2.1 Matrix Addition 


Matrices are added by adding the individual elements of one matrix to the corresponding 
elements of the other matrix. 

<Hj — Clij T bij 

or C = A + B 

Matrix addition has the following properties: 

1 . Commutativity A + B = B + A 

2. Associativity (A + B) + C = A + (B + C) 
and 

3. A + (— A) = 0 

4. A + 0 = A 

From matrix addition it can be seen that the product of a matrix A and a number p is 
simply pA where every element of the matrix is multiplied individually by p. 

Transpose of a Matrix 

A matrix is transposed when the rows and columns are interchanged: 


A = 


an 

«21 

031 


ai2 

013 

022 

023 

032 

033 


a t 


Oil 

021 

031 

012 

022 

032 

013 

023 

033 
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Notice that the principal diagonal elements stay the same after transposition. 

A matrix is symmetric if it is equal to its transpose eg: akj = ajk- 

It is skew symmetric if A T = —A eg: akj = — ajk ■ The principal diagonal of a skew symmetric 
matrix is composed of elements that are zero. 

Other Types of Matrix 

Diagonal matrix: all elements above and below the principal diagonal are zero. 

4 0 0" 

0-10 
0 0 2 

Unit matrix: denoted by I, is a diagonal matrix where all elements of the principal diagonal 
are 1. 

'l 0 o ' 

0 1 0 
0 0 1 


5.2.2 Matrix Multiplication 

Matrix multiplication is defined in terms of the problem of determining the coefficients in 
linear transformations. 

Consider a set of linear transformations between 2 coordinate systems that share a common 
origin and are related to each other by a rotation of the coordinate axes. 

Two Coordinate Systems Rotated Relative to Each Other 

If there are 3 coordinate systems, x, y, and z these can be transformed from one to another: 

xi = auyi + a\2D2 

x-2 = a-2iyi + (1221/2 

2/1 = buzi+b 12 z 2 

2/2 = & 21^1 + & 22^2 

X \ = CnZl + Cl 2 Z 2 

X 2 = C 21 z 1 +c 22 z 2 

By substitution: 

X\ = an(biizi + bi 2 z 2 ) + ai 2 (b 2 izi + b 22 z 2 ) 
x 2 = a 21 (b n zi + b 12 z 2 ) + a 22 (b 21 z 1 +b 22 z 2 ) 
xi = (an&n + ai 2 (b 2 i)z\ + (an&i2 + a\ 2 b 22 )z 2 
x 2 = (a 21 b u +a 22 (b 2 i)z 1 + (a 2 ibi 2 + a 22 b 22 )z 2 
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Therefore: 

cn = (aii&n + 012(621) 

C12 = (011612 + 012622) 

C21 = (021611 + 022621) 

C22 = (021612 + 022622) 

The coefficient matrices are: 


A = 

an 

012 


021 

a-22 

B = 

611 

612 

621 

622 

C = 

Cll 

C12 


C21 

C22 


From the linear transformation the product of A and B is defined as: 


C = AB = 


(011611 + 012621) (011612 + 012622) 

(021611 + 022621) (021612 + 022622) 


In the discussion of scalar products it was shown that, for a plane the scalar product is 
calculated as: a.b = 0161 + 0262 where a and b are the coordinates of the vectors a and b. 


Now mathematicians define the rows and columns of a matrix as vectors: 
A Column vector is b = 


611 

621 


And a Row vector a = 


Oil 012 


Matrices can be described as vectors eg: 


A = 

an 

dl2 



ai 


<321 

(122 


a 2 


and 


B = 


611 612 

621 622 


bib 2 


Matrix multiplication is then defined as the scalar products of the vectors so that: 


C 


ai.bi ai.b 2 
a 2 .bi a 2 .b 2 
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From the definition of the scalar product ai.bi = a\\b\\ +012621 etc. 
In the general case: 



ai.bi 

ai.b 2 

. ai.b n 

a 2 .bi 

a 2 .b 2 

. a 2 .b n 

®mhl 

•bn b 2 

a m b n 


This is described as the multiplication of rows into columns (eg: row vectors into column 
vectors). The first matrix must have the same number of columns as there are rows in the 
second matrix or the multiplication is undefined. 


After matrix multiplication the product matrix has the same number of rows as the first 
matrix and columns as the second matrix: 


1 3 4 
6 3 2 


times 


has 2 rows and 1 column 


39 

35 


ie: first row is 1*2 + 3*3 + 4*7 = 39 and second row is 6*2 + 3*3 + 2*7 = 35 


AB = 


1 3 2 

2-13 


times 


2 3 4 

3 2 1 
5 1 3 


has 2 rows and 3 columns 


21 11 13 
16 7 16 


Notice that BA cannot be determined because the number of columns in the first 
matrix must equal the number of rows in the second matrix to perform matrix 
multiplication. 

Properties of Matrix Multiplication 

1. Not commutative AB 7 ^ BA 

2 . Associative A(BC) = (AB)C 
(A;A)B = fc(AB) = A(A:B) 

3. Distributative for matrix addition 
(A + B)C = AC + BC 

matrix multiplication is not commutative so C(A + B) = CA + CB is a separate case. 

4. The cancellation law is not always true: 

AB = 0 does not mean A = 0 or B = 0 


There is a case where matrix multiplication is commutative. This involves the scalar matrix 
where the values of the principle diagonal are all equal. Eg: 


S 


A; 0 0 
0 k 0 
0 0 A: 


In this case AS = SA = kA. If the scalar matrix is the unit matrix: AI = I A = A. 
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5.2.3 Linear Transformations 

A simple linear transformation such as: 

x\ = anyi + ai2V2 

X2 = a2iyi + a22V2 

can be expressed as: 

x = Ay 

eg: 


X\ 


Oil 

012 

* 

yi 

X2 


021 

022 


V2 


and y\ = 6112:1 + 612^2 
1)2 = 62121 + 62222 
as: y = Bz 

Using the associative law: 
x = A(Bz) = ABz = Cz 
and so: 

_ (ail6ll +ai262l) (011612 + 012622) 

(02l6li+a2262l) (021612 + 022622) 


as before. 


5.3 Indicial Notation 

Consider a simple rotation of coordinates: 
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Figure 55 


is defined as x\ , x 2 
x v is defined as x\ , x 2 
The scalar product can be written as: 
s.s = g^x^x” 

Where: 


9fj.v — 


1 0 
0 1 


and is called the metric tensor for this 2D space. 

, ' 

/ . + 9 22 x 2 x 2 

. , + 9 21 X 2 X 1 

+g i 2 xix 2 
s.s = g 11 x 1 x 1 

Now, g n = l, g V2 = 0, g 2 \ = 0, g 2 2 = 1 so: 

. 1 

'+X2X„ 

S.S = X 1 X [ 

If there is no rotation of coordinates the scalar product is: 
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S.S = X\X\ + X2X2 

s 2 = xj + X 2 

Which is Pythagoras' theorem. 


5.3.1 The Summation Convention 

Indexes that appear as both subscripts and superscripts are summed over. 

, r 

, , +9 22 x 2 x 2 

, , +921 X 2 X 1 

g^x^x" = g ux\x l 

by promoting v to a superscript it is taken out of the summation ie:. 


1/ Li U ~\~92u^2^iy 

g^x fl x 1 ' = givXiXv 


5.3.2 Matrix Multiplication in Indicial Notation 


Consider: 

Columns times rows: 

XlVl XiV2 

x 2 yi X 2 V2 

Matrix product XY = Xjij-j Where i = 1, 2 j = 1, 2 
There being no summation the indexes are both subscripts. 


Xl 

X2 


times 


V 1 2/2 


Rows times columns: 

X\ X2 

times 

2/1 




2/2 


XWl X2V2 


Matrix product XY = bijx'y 0 

Where Sij is known as Kronecker delta and has the value 0 when i / j and 1 when i = j. It 
is the indicial equivalent of the unit matrix: 

’1 o’ 

0 1 


There being summation one value of i is a subscript and the other a superscript. 

A matrix in general can be specified by any of: 

M 2 , Mij , Mj , M 13 depending on which subscript or superscript is being summed over. 


5.3.3 Vectors in Indicial Notation 

A vector can be expressed as a sum of basis vectors. 

x = aiei + a 2 e 2 + a3 e 3 

In indicial notation this is: x = a*ej 
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5.3.4 Linear Transformations in indicial notation 

Consider x = Ay where A is a coefficient matrix and x and y are coordinate matrices. 

In indicial notation this is: 

x M = A£x v 

which becomes: 

x\ = aux 1 + a 12^2 A u 1 3X3 

X2 = Cl-2\Xi + a 22 x 2 A a 23 x 3 
x 3 = &31 x \ + Cl32 x 2 + a 33 x 3 


5.3.5 The Scalar Product in indicial notation 

In indicial notation the scalar product is: 
x.y = 5ijX l y j 

5.4 Analysis of curved surfaces and transformations 

It became apparent at the start of the nineteenth century that issues such as Euclid's parallel 
postulate required the development of a new type of geometry that could deal with curved 
surfaces and real and imaginary planes. At the foundation of this approach is Gauss's 
analysis of curved surfaces which allows us to work with a variety of coordinate systems and 
displacements on any type of surface. 

Elementary geometric analysis is useful as an introduction to Special Relativity because it 
suggests the physical meaning of the coefficients that appear in coordinate transformations. 

Suppose there is a line on a surface. The length of this line can be expressed in terms of a 
coordinate system. A short length of line As in a two dimensional space may be expressed 
in terms of Pythagoras' theorem as: 

As 2 = Ax 2 + Ay 2 

Suppose there is another coordinate system on the surface with two axes: xl, x2, how can 
the length of the line be expressed in terms of these coordinates? Gauss tackled this problem 
and his analysis is quite straightforward for two coordinate axes: 

Figure 1: 
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A curved surface with a set of 
coordinates on the surface 



A plane with a set of Cartesian 
coordinates 



The plane set at a tangent to the surface so 
that the coordinates can be compared 


Figure 56 


It is possible to use elementary differential geometry to describe displacements along the 
plane in terms of displacements on the curved surfaces: 

^r = Ax 1 g + Ax 2 g 

The displacement of a short line is then assumed to be given by a formula, called a metric, 
such as Pythagoras' theorem 

AS 2 = AY ' 2 + AZ ' 2 


The values of Ay and A Z can then be substituted into this metric: 
A ^ 2 = ( £ + Aa; 2 £ ) 2 + ( Ax-! If + Ax 2 g ) 2 

Which, when expanded, gives the following: 

AS 2 = 

(££ + lflf) A ^i 

+(££+gg) A ^ 

+(££+gg) A ^ 


This can be represented using summation notation: 

AS 2 = £i=i ELi(g H + gg)A^Ax fe 


Or, using indicial notation: 
AS 2 = g ik Ax i Ax k 


Where: 

_ f 5Y SY I SZ 6Z \ 
yik V fix* 5x k Sx l 5x k ' 

If the coordinates are not merged then As is dependent on both sets of coordinates. In 
matrix notation: 
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As 2 = gAxAx 
becomes: 


Axi Ax 2 


times 


a b 
c d 


times 


Ax’i 

Ax ’2 


Where a, b, c, d stand for the values of g 
Therefore: 


Axia + Ax’2C 


Ax\b + Ax2(i 


times 


Ax'i 

Ax 2 


Which is: 

(Axia + Ax2c)Ax’i + (Axi6 + Ax2<i)Ax2 = Axi 2 a + 2AxiAx2(c + 6) + Ax2 2 d 

So: 

As 2 = Axi 2 a + 2 AxiAx 2 (c + 6 ) + Ax2 2 d 

As 2 is a bilinear form that depends on both Axi and Ax2- It can be written in matrix 
notation as: 

As 2 = Ax t AAx 

Where A is the matrix containing the values in . This is a special case of the bilinear 
form known as the quadratic form because the same matrix (Ax) appears twice; in the 
generalised bilinear form B = x T Ay (the matrices x and y are different). 

If the surface is a Euclidean plane then the values of gik are: 

8Y / 8x\8Y / 8x\ + 8Z / 8x\8Z / 8x\ 8Y / 8X28Y / 8 x\ + 8 Z / 8X28Z / 8 x\ 

8Y / 8X28Y / 8x\ + 8Z / 8x28 Z / 8x\ 8Y / 8X28Y / 8x2 + 8Z / 8X28Z / 8x2 

Which become: 




1 0 
0 1 


So the matrix A is the unit matrix I and: 

As 2 = Ax t IAx 
and: 

As 2 = Axi 2 + Ax’2 2 

Which recovers Pythagoras' theorem yet again. 

If the surface is derived from some other metric such as As 2 = — AY 2 + AZ 2 then the values 
of gik are: 

— 8Y / 5 x\ 8 Y / 8x\ + 8Z / 8x\8Z / 8x\ —8Y/8x28Y/8x\ + 8Z / 8X28Z / 8 x\ 

—8Y/8X28Y/8X1 + 8Z / 8X28Z / 8x\ —8Y 1 8X28Y j 8x2 + 8Z / 8X28Z / 8x2 

Which becomes: 
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9fi v 


-1 0 

0 1 


Which allows the original metric to be recovered ie: As 2 = — Axi 2 + Ax2 2 . 

It is interesting to compare the geometrical analysis with the transformation based on matrix 
algebra that was derived in the section on indicial notation above: 


s.s = gnxix l 
Now, 


, , +921 x 2 x i 

+gi2XlX 2 


'+922 x 2 x 2 


9 nv 


1 0 
0 1 


ie: 5n = 1, 512 = 0, 521 = 0, 522 = 1 so: 

, / 

+X2X 0 

S.S = XlX 1 

If there is no rotation of coordinates the scalar product is: 


S.S = X\X\ + X2X2 

s 2 = x 2 + x 2 

Which recovers Pythagoras' theorem. However, the reader may have noticed that Pythagoras' 
theorem had been assumed from the outset in the derivation of the scalar product (see 
above) . 

The geometrical analysis shows that if a metric is assumed and the conditions that allow 
differential geometry are present then it is possible to derive one set of coordinates from 
another. This analysis can also be performed using matrix algebra with the same assumptions. 

The example above used a simple two dimensional Pythagorean metric, some other metric 
such as the metric of a 4 D Minkowskian space: 

AS 2 = - AT 2 + AX 2 + AY 2 + AZ 2 


could be used instead of Pythagoras' theorem. 
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6.1 Applications of Special Relativity 

In this chapter we continue the study of special relativity by applying the ideas developed 
in the previous chapter to the study of waves. 

First, we shall show how to describe waves in the context of spacetime. We then see how 
waves which have no preferred reference frame (such as that of a medium supporting them) 
are constrained by special relativity to have a dispersion relation of a particular form. 
This dispersion relation turns out to be that of the relativistic matter waves of quantum 
mechanics. 

Second, we shall investigate the Doppler shift phenomenon, in which the frequency of a wave 
takes on different values in different coordinate systems. 

Third, we shall show how to add velocities in a relativistically consistent manner. This will 
also prove useful when we come to discuss particle behaviour in special relativity. 

A new mathematical idea will be presented in the context of relativistic waves, namely the 
spacetime vector or four-vector. Writing the laws of physics totally in terms of relativistic 
scalars and four- vectors ensures that they will be valid in all inertial reference frames. 


6.1.1 Waves in Spacetime 

Waves in Spacetime 

We now look at the characteristics of waves in spacetime. Recall that a wave in one space 
dimension can be represented by 

A(x,t) = Aosin(fex — uit) 

where Aq is the (constant) amplitude of the wave, k is the wavenumber, and w is the angular 
frequency, and that the quantity <f> = kx — uit is called the phase of the wave. For a wave in 
three space dimensions, the wave is represented in a similar way, 

A(x, t ) = Aq sin(k • x — uit) 

where x is now the position vector and k is the wave vector. The magnitude of the wave 
vector, |k| = k is just the wavenumber of the wave and the direction of this vector indicates 
the direction the wave is moving. The phase of the wave in this case is (f> = k • x — ait. 
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Figure 57 Figure 5.1: Sketch of wave fronts for a wave in spacetime. 


Figure 5.1: Sketch of wave fronts for a wave in spacetime. The large arrow is the associated 
wave four-vector, which has slope oj/ck. The slope of the wave fronts is the inverse, ck/ui. 

In the one-dimensional case (f> = kx — ut. A wave front has constant phase c/>, so solving this 
equation for t and multiplying by c, the speed of light in a vacuum, gives us an equation for 
the world line of a wave front: 

Ct= ckx_c± = cx_c$ ( wavefront ). 

The slope of the world line in a spacetime diagram is the coefficient of x, or c/u p , where 
u p = u/k is the phase speed. 

6.1.2 An application of relativity to waves 

Returning to the phase of a wave, we immediately see that 

(j) = k ■ x — cut = k ■ x — (ui/c) (ct) = k ■ x. 

Thus, a compact way to write a wave is A(x) = sin(fc-x).(6.8) 

Since x is known to be a four-vector and since the phase of a wave is known to be a scalar 
independent of reference frame, it follows that k is indeed a four-vector rather than just a 
set of numbers. Thus, the square of the length of the wave four- vector must also be a scalar 
independent of reference frame 
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k-k = k ■ k — w 2 /c 2 = const. 



Figure 58 Figure 5.2: Resolution of a four-vector into components in two different 
reference frames. 


Resolution of a four- vector into components in two different reference frames. 

Let us review precisely what this means. As this figure shows, we can resolve a position 
four- vector x into components in two different reference frames, e.g ( X. T) and but 

these are just different ways of writing the same vector. 

This is exactly the same as the way a three-vector has different components in a rotated 
frame. 
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Similarly, just as a three-vector has the same nragnititude in all frames, so does a 4-vector; 
i.e, 


X 2 - c 2 T 2 = X ' 2 - c 2 T ' 2 

Applying this to the wave four-vector, we infer that 

k —c u =k —c u 

where the unprimed and primed values of k and co refer to the components of the wave 
four- vector in two different reference frames. 

Up to now, this argument applies to any wave. However, waves can be divided into two 
categories, those for which a special reference frame exists, and those for which there is no 
such special frame. 

As an example of the former, sound waves look simplest in the reference frame in which 
the gas carrying the sound is stationary. The same is true of light propagating through a 
material medium with an index of refraction not equal to unity. In both cases the speed of 
the wave is the same in all directions only in the frame in which the material medium is 
stationary. 

If there is no material medium, then there is no unambiguous way of finding a special frame 
so the waves must fall into the second category. This includes all waves in a vacuum, such 
as light. 

In this case the following argument can be made. An observer moving with respect to waves 
of frequency to and wave number k sees waves of frequency co' and wave number k! . If the 
observer can tell in any way that they come from a source moving with respect to them, 
then they can use this to identify a special frame for those waves, so the waves must look 
just like ones from a stationary source of frequency co 

This forces us to conclude that for such waves 

2 2; 2 i 2 

uj = c k +/x 

where fr is a constant. All waves in a vacuum must have this form, a much more restricted 
choice than in classical physics. 

In classical physics, co and k for light are related by 


co = ck 


In relativistic physics, we've seen that for waves with no special reference frame, such as 
light, co and k are related by 


UJ 


2 


c 2 k 2 + fj, 


2 
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If f,i=0 then the relativistic equation reduces to the classical, so we can assume that, for 
light, |i does equal zero. 

This means that light does not have a mimimum frequency. 

If [x is not zero then the wave being described are dispersive. The phase speed is 


Up — 


OJ 

k 



This phase speed always exceeds c, which at first may seem like an unphysical conclusion. 
However, the group velocity of the wave is 


duj kc 2 kc 2 c 2 

Ua dk y/k 2 c 2 + n 2 UJ Up 

which is always less than c. Since wave packets and hence signals propagate at the group 
velocity, waves of this type are physically reasonable even though the phase speed exceeds 
the speed of light. 

Another interesting property of such waves is that the wave four- vector is parallel to the 
world line of a wave packet in spacetime. This is easily shown by the following argument. 

The spacelike component of a wave four- vector is k, while the timelike component is co / c . 
The slope of the four- vector on a spacetime diagram is therefore co /kc. However, the slope 
of the world line of a wave packet moving with group velocity is c/u g , which is also co /kc . 

Note that when we have k is zero we have co=p. In this case the group velocity of the wave 
is zero. For this reason we sometimes call [i the rest frequency of the wave. 


6.1.3 The Doppler Effect 

You have probably heard how the pitch of a train horn changes as it passes you. When the 
train is approaching, the pitch or frequency is higher than when it is moving away from you. 
This is called the Doppler effect. A similar, but distinct effect occurs if you are moving past 
a source of sound. If a stationary whistle is blowing, the pitch perceived from a moving car 
is higher while moving toward the source than when moving away. The first case thus has a 
moving source, while the second case has a moving observer. 

In this section we will compute the Doppler effect as it applies to light moving through a 
vacuum. The figure below shows the geometry for computing the time between wave fronts 
of light for a stationary and a moving reference frame. 

• The time between wavefronts for the stationary observer, in the stationary frame, is T. 

• The time between wavefronts for the moving observer, in the stationary frame, is T' . 

• The time between wavefronts for the moving observer, in the stationary frame, is r. 

Since the world lines of the wave fronts have a slope of unity, the sides of the shaded triangle 
both have the same value, C. If the observer is moving at velocity v , the slope of the 
observer's world line is c/v, i.e 
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c _ cT + X 
v ~ X 


Solving this for X and substituting in give cT' = cT + X gives 


T' = 


T 


1 - 


(1) 


In classical physics T' and r are the same so this formula, as it stands, leads directly to the 
classical Doppler shift for a moving observer. 

However, in relativity T' and r are different. We can use the Lorentz transformation to 
correct for this. 

The second wavefront passes the moving observer at ( vT f , cT ') in the stationary observers 
frame, but at (0,cr) in its own frame. The Lorentz transform tells us that. 

ct = 7 (^cT' — -vT'^J = cT 'y ^1 — = cT ' /y 
Substituting in equation (1) gives 


r = 


T 


= T 


1—v/c 


{l-v / c)(l+v / c) 
{1—v/c) 2 

r-p j C-\~V 

V C—V 


From this we infer the relativistic Doppler shift formula for light in a vacuum: 


v 


/ 



since frequency is inversely proportional to time. 

We could go on to determine the Doppler shift resulting from a moving source. However, by 
the principle of relativity, the laws of physics should be the same in the reference frame in 
which the observer is stationary and the source is moving. Furthermore, the speed of light 
is still c in that frame. Therefore, the problem of a stationary observer and a moving source 
is conceptually the same as the problem of a moving observer and a stationary source when 
the wave is moving at speed c. 

This is unlike the case for, say, sound waves, where the stationary observer and the stationary 
source yield different formulas for the Doppler shift. 
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6.2 External Links 

• The Relativistic Doppler Effect and its Measurement 1 

• Horn Forum 2 


1 http : //www . wbabin .net/ sf arti/ sf art i 17 .pdf 

2 http : //www .pbmissions . com/forum/ 
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7 Mathematical Transformations 


The teaching of Special Relativity on undergraduate physics courses involves a considerable 
mathematical background knowledge. Particularly important are the manipulation of vectors 
and matrices and an elementary knowledge of curvature. The background mathematics is 
given at the end of this section and can be referenced by those who are unfamiliar with 
these techniques. 


7.1 The Lorentz transformation 

The Lorentz transformation deals with the problem of observers who are moving relative to 
each other. How are the coordinates of an event recorded by one observer related to the 
coordinates of the event recorded by the other observer? The standard configuration 
used in the calculation of the Lorentz transformation is shown below: 


Transformation of Coordinates 


s 1 


Event 



The observers are moving at a relative velocity of v and each 
observer has their own set of coordinates (x^zT) and 
(x\y\z\t'). What coordinates do they assign to the event? 


Figure 59 
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There are several ways of deriving the Lorentz transformations. The usual method is to 
work from Einstein's postulates (that the laws of physics are the same between all inertial 
reference frames and the speed of light is constant) whilst adding assumptions about isotropy, 
linearity and homogeneity. The second is to work from the assumption of a four dimensional 
Minkowskian metric. 


In mathematics transformations are frequently symbolised with the "maps to" symbol: 


(■ x,y,z,t ) h- (x ’ y 




7.1.1 The linearity and homogeneity of spacetime 

Consider a clock moving freely. According to Newton's first law, objects continue in a state 
of uniform motion unless acted upon by a force; so, the velocity of the clock in any given 
direction ( dxi/dt ) must be a constant. 

If the clock is a real clock with readings given by r then the rate of change with respect 
to these readings of the elapsed time at any other point in an inertial frame of reference, 
dt/dr , will be a constant. If the clock were to tick at an uneven rate compared with other 
clocks then the universe would not be homogeneous in time — at some times the clock would 
appear to accelerate. This would also mean that Newton's first law would be broken and 
the universe would not be homogeneous in space. 

If dxi/dt and dt/dr are constant then dx^n = 1,2, 3, 4) is also constant. This means that 
the clock is not accelerating, i.e. that d?x^/dr 2 = 0. 

Linearity is demonstrated by the way that the length of things does not depend on position 
or relative position; for instance, if x' = ax 2 the distance between two points would depend 
upon the position of the observer whereas if the relationship is linear (xf = ax) separations 
are independent of position. 

The linearity and homogeneity assumptions mean that the coordinates of objects in the S' 
inertial frame are related to those in the S inertial frame by: 

x'v = (E K^Xfj) + B v 

This formula is known as a Poincare transformation. It can be expressed in indicial 
notation as: 

x v = k v u x> i + B v 

If the origins of the frames coincide then B u can be assumed to be zero and the equation: 

x u — E A i/fiXji, 

Those who are unfamiliar with the notation should note that the symbols x\ etc. mean 
x\ = x, x 2 = y, x 3 = z and X 4 = t so the equation above is shorthand for: 

x' = anx + ai- 2 y + ai 3 z + ai^t 

1 / = a 2 \x + a 22 y + a 23 z + a 2 4 t 

z! = a 3 ix + a 32 y + a 33 z + a 34 f 
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t' = C 141 X + a 42 y + a 43 z + a 44 t 

In matrix notation the set of equations can be written as: 
x ; = Ax 

The standard configuration (see diagram above) has several properties, for instance: 

The spatial origin of both observer's coordinate systems lies on the line of motion so the x 
axes can be chosen to be parallel. 

The point given by x = vt is the same as x' = 0. 

The origins of both coordinate systems can coincide so that clocks can be synchronised 
when they are next to each other. 

The coordinate planes ,y, y' and z,z', can be arranged to be orthogonal (at right angles) to 
the direction of motion. 

Isotropy means that coordinate planes that are orthogonal at y=0 and z=0 in one frame 
are orthogonal at y '=0 and z '=0 in the other frame. 

According to the relativity principle any transformations between the same two inertial 
frames of reference must be the same. This is known as the reciprocity theorem. 


7.1.2 The Lorentz transformation 

From the linearity assumption and given that at the origin y = 0 = y so there is no constant 
offset then y =Ky and y = Ky , therefore K=l. So: 



and, by the same reasoning: 

/=* 

Now, considering the x coordinate of the event, the z and y axes can be assumed to be 0 (ie: 
an arbitrary shift of the coordinates to allow the event to lie on the x axes). If this is done 
then the linearity consideration and the fact that x = vt and x =0 are the same point gives: 

( 1 ) X -y(x-rt) 

where 7 is a constant. According to the reciprocity theorem we also have: 

( 2 ) x = 7 ( x' +vt ] 

Einstein's assumption that the speed of light is a constant can now be introduced so that 
x = ct and also x =ct . So: 

Ct=1 t(c-v) 

and 

ct = 7 t'( c +^) 

So: 
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cHt '=7 2 « (c2 -* 2) 

and 

7 = 1 

' y/l-V 2 / C 2 

Therefore the Lorentz transformation equations are: 

^'=7 (t—vx/c 2 ) 
x '=l (x~vt) 



The transformation for the time coordinate can derived from the transformation for the x 
coordinate assuming x = ct and x =ct or directly from equations (1) and (2) with a similar 
substitution for x = ct. 

The coefficients of the Lorentz transformation can be represented in matrix format: 


ct' 


7 

-l-r 

0 

o' 


ct 

x' 


-17 

7 

0 

0 


x 

y' 


0 

0 

1 

0 


y 

z' 


0 

0 

0 

1 


z 


A coordinate transformation of this type, that is due to motion, is known as a boost. 


Example: convert LT matrix to linear equation 



Matrix multiplication 

cf 

= let 

— 

lix 

+ 

0 

+ 

0 

involves multiplying 
corresponding row 

x' 

= -~cl ^ 

+ 

lx 





and column 

y' 

= 0 

+ 

0 

+ 

y 

+ 

0 

elements then 

/ 





adding these 


= 0 

+ 

0 

+ 

0 

+ 

z 


The Lorentz transformation equations can be used to show that: 
c 2 dt 2 - dx' 2 - dy' 2 - dz' 2 = c 2 dt 2 - dx 2 - dy 2 - dz 2 

Although whether the assumptions of linearity, isotropy and homogeneity in the derivation 
of the Lorentz transformation actually assumed this identity from the outset is a moot point. 

Given that: c 2 dt 2 — dx' 2 — dy 2 — dz 2 also equals c 2 dt" 2 — dx' 2 — dy" 2 — dz" 2 and a continuous 
range of other transformations it is clear that: 

As 2 = c 2 At 2 — Ax 2 — Ay 2 — A z 2 

The quantity As is known as the spacetime interval and the quantity As 2 is known as 

the squared displacement. 
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A given squared displacement is constant for all observers, no matter how fast they are 
travelling, it is said to be invariant. 

The equation: 

ds 2 = c 2 dt 2 — dx 2 — dy 2 — dz 2 
is known as the metric of spacetime. 

7.1.3 Another look at the Lorentz transformation 

We have seen how length and duration both look different to a moving observer: 

• lengths are contracted by a factor 7 , when measured at the same time. 

• durations are contracted by a factor 7 , when measured in the same place. 

We can extend these results to allow for measurements taking place at different times and 
places. 

Lets consider two observers, O and O' such that O' is moving at velocity v along the x-axis 
with respect to O. We'll use primed variables for all the measurements O' makes. 

We can assume for now both observers have the same origin and x-axis because we already 
know how to allow for observers being relatively rotated and displaced. We can put these 
complications back in later. 

Now any length or duration can be written as the difference between two coordinates, for 
the two ends of the body or the start and end of the event, so it is sufficient to know how to 
change coordinates from one frame to the other. 

We know how to do this in classical physics, 

x' = x — vt 
t' = t 


we need to extend this to relativity. 

Notice that the in classical physics the relationship is linear; the graphs of these equations 
are straight lines. This makes the maths much simpler, so we will try to find a linear 
relationship between the coordinates for relativity, i.e equations of the general form 

x' = mx + nt 
t! = px + qt 

where m, n, p and q are all independent of the coordinates. 

To begin with know that 

• when t=0, xf = 7X (Lorentz contraction) 

• when x=0, tJ = jt (Time dilation) 

and that O' is travelling at velocity v. They measure their position to be at x/=0, but O 
measures it to be at vt so we must have 
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xf = 0 when x-vt = 0 

The only relationship between x and xf that satisfies these criteria is 


x = y(x — vt) 

Both observers must measure the same speed for light, 


X = ct=> x' = ct' 


or, substituting and rearranging, 


x = ct =7 t' = 7 


x — vt 


The only linear relationship between t and tf that satisfies these criteria is 


/ , vx . 

t = 7 ( j + t) 

c z 


So the primed and unprimed coordinates are related by 


x' = 7 (x — vt) 

t' = i(-5+«) 

These equations are called the Lorentz transform. 

They look simpler if we write them in terms of ct rather than t 

x' = 7 (x-^ct) 

ct' = 7 (~^x + ct) 

Written this way they look much like the equations describing a rotation in three dimensions. 
In fact, once we allow for the different Pythagorean theorem, they are exactly like the 
equations for rotation. 

• If observers are moving relative to each other, their coordinate systems are rotated in the 
(x,ct) plane. 


7.1.4 The derivation of the constancy of the speed of light from the first 
postulate alone 

The following text was taken from a "Wikiversity" lecture. 

The lorentz transformation 1 can be deduced from the first postulate alone with no additional 
assumptions. 


1 http : //en . Wikipedia. org/wiki/°/o201orentzy o 20transf ormation°/ 0 20 
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The Lorentz transformation describes the way a vector in spacetime as seen by an observer 
01 changes when it is seen by an observer 02 in a different inertial system. 

The postulate of special relativity states that the laws of physics are independent of the 
velocity of an observer. This is equivalent to the requirement that the mathematical 
formulation of the laws of physics must be invariant if the coordinates of space and time are 
altered using a lorentz transformation. 

This is the reason why the Lorentz transformation is a central concept in special relativity. 

Four-vectors are vectors that, subjected to the Lorentz transformation, behave like vectors 
from spacetime. The standard approach to find relativistic laws to the corresponding 
non-relativistic ones is to find four- vectors to replace space vectors. Surprisingly often this 
simple approach is successful. 

Here, for simplicity, the space coordinates are choosen so that the x-axis points in the 
direction in which 02 is moving with respect to 01 and the y and z coordinates are not 
considered. 


Here is a way to determine a matrix L{v) that transforms coordinates from 01 to 02 where 
v is the velocity of 02 seen from 01. 


Most generally, this matrix looks like L(v) = 


a b' 
c d , 


and transforms 


't' 
, x , 


to 


V 


via 


t' 


a b\ ( t 
c d / \ x 


The first requirement on this matrix is that 02 will be at rest (in space) within its own 
coordinate system. 


Therfore 




f r ' 

a b 


t 


c d \ tv , 


for any time t. 


This gives us ct + tvd = 0 =>• c = —vd and L(v) = 


a b ' 

— vd d , 


The backward transformation from 02 to 01 is described by the inverse matrix L{v') = 
L(u) _1 = (det(L(u)) _1 ^ ( ^j ^ (where v' is the velocity of 01 seen from 02) has to leave 

01 at rest in its own coordinates. 

Assuming v' = —v (being strict this would have to be proved) we have 

N-l ( d ~ b \( * ' 


V 

, 0 , 


(det (L(v)Y 


vd a 


j f° r any time t! resulting in a = d =>• L(v) = 


a b' 
—va a , 


The coefficients are functions of v. For all v the transformation is invertible, so 0 / det L(v) = 
a(v) 2 + va(v)b(v) =£- a(v) / 0. 


For v=0 L(v) is the identity matrix, so b(0)=0. This allows to rewrite b(v) = — va(v)e(v) 
with a yet unknown function e(v). 
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Now L(v) = a(v) 



Now for 
L(v 0 )L(vi) = 

L{v 0 )L{v i) = 


every vq.vi there exists a velocity v 2 
L(v 2 ). Using what we have proved up to 

' 1 -W(n)| o(vi) ( 1 


a(v 0 ) 


-vo 


J 


-Vl 


1 




, \ / x (l + v 0 vie(v 0 ) -vi£(vi)-v 0 s(vq)\ 
= a(v 0 )a(vi) [ l + vm e M l 


= a(y 2 ) 


VQ-Vl 

1 -v 2 e(v 2 )\ 


y-v 2 1 
= L(v 2 ) 




such 

now 


that 

gives 


Comparing the diagonal components, we see immediately that e(vo) = e(vi) for arbitrary 
values of vq,vi, and so e(v) = const. = e. 


Using this we can rewrite the equation for the a(vo)a(vi)(l + vqV\£) = a(v 2 ) and 
a(vo)a(vi)(— vo — vi) = — v 2 a(v 2 ) by comparing each of the components at indeces (1,1) and 
(2,1) respectively. Substituing a(v 2 ) yields (1) a(vo)a(vi)(— vo — vi) = — v 2 a(vo)a(vi)(l + 
vovi)e from which we have v 2 = • 

Reinserting this into the first equation we have a(vo)a(vi)(l + vovie) = 

In case v := vo = — vi this simplifies to a(v)a(— v)(l — v 2 e ) = a(0) = 1 as L(0) is the identity 
matrix. 


Let <p(x) = <p(—x) be an even function and ip(x) = —ip(—x) an odd function such that 
a{v) = e^ + ^ v \ 

This gives us e ¥ , (U+V’(U+‘p(U-V , (U- 27 ” fc = (1 — v 2 e) _1 . Taking the logarithm and simplifying 


yields ip(v) = — iln(l — v 2 e) + 7rifc from which we get a(v) = ±- 


= e V’( t ') w ith any odd 


u oy | /1 i/r u war vv xxx^xx vv ^ u lo y u j — r- — - i 

v ^ 0 

function ^(—v) = —ip(v). The negative sign is ruled out by a(0) = 1. 

Substituting a(v) in equation (1) verifies the solution but leaves us with V’(vo) + ip(v{) = 

To prove i/>(v) = 0 we flip the direction of the x-axis in the base system as well as in the 
transformed system. The choice of the direction of the axis must not lead to different 
results. In the flipped coordinates the Lorentz transformation takes the form L{y') = 


a(v') 


1 — v'e 


—v 


1 


= a(-v) 


1 VE 
V 1 , 


for v' = —v. 


On the other hand we can compute L(V) by using a coordinate transformation on L(v): 
\ 0\ ,x/l ve\/l 0 \ , ( 1 — ve^ 


L(v') = 


0 -1 


a(v) 


1 


,0 -1 


—v 


1 


from which we get a(— v) = a(v) 


= a(v) 

and hence 1 p(—v) = ip(v). Together with ip(—v) = —ip(v) this gives v ) = 0 

1 


Looking at the eigenvectors 2 of L(v) we see they are 


± 4 = 


independent of v. 


2 http : //en . Wikipedia. org/wiki/eigenvector"/ 0 20 
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This is equivalent to a transformation invariant velocity ^ e —. cn which experimentation 
shows to be equal to the speed of light. Altogether we have: 



Note that for e = 0 we would have the Galileian case with no finite invariant velocity. 

Behold that if we did the same deduction not in the spacetime plane for (t, x) but in the 
space plane (x, y) with slope 3 m=y/x instead of velocity v=x/t nothing would change except 
that, not by calculation but by experimentation alone, it would have turned out that s = — 1 
and L(m) is just a rotation in the euclidian 4 space with no real valued eigenvectors. 

It is an interesting exercise to compare this result with the assumptions of reciprocity, 
homogeneity and isotropy mentioned earlier. Do the linear equations contained in the matrix 
formulation of the Lorentz transformation above assume a particular spacetime from the 
outset? 

It is further important to mention that the determinant 5 detL(u) = 1 independent of 
velocity. This directly implicates that spacetime volumes are conserved. 

7.1.5 The geometry of space-time 

SR uses a 'flat' 4-dimensional Minkowski space, which is an example of a space-time. This 
space, however, is very similar to the standard 3 dimensional Euclidean space, and fortunately 
by that fact, very easy to work with. 

The differential of distance(ds) in Cartesian 3D space is defined as: 

ds 2 = dx\ + dx 2 + dx 2 

where ( dxi,dx 2 ,dx ^ ) are the differentials of the three spatial dimensions. In the geometry of 
special relativity, a fourth dimension, time, is added, with units of c, so that the equation 
for the differential of distance becomes: 

d-s 2 = dx 2 + dx\ + dx 2 — c 2 dt. 2 

In many situations it may be convenient to treat time as imaginary (e.g. it may simplify 
equations), in which case t in the above equation is replaced by i.t', and the metric becomes 


3 http : //en . Wikipedia . org/wiki/ slope°/ 0 20 

4 http : //en . Wikipedia . org/wiki/y o 20Euclideanyo20geometryy o 20 

5 http : //en . Wikipedia . org/wiki/determinanty o 20 
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ds 2 = dx\ + dx\ + dx\ + c 2 (dt') 2 

Note that ds in this case is the distance, and not the interval. Caution should be exercised 
in the use of 'imaginary' time, it is not part of the modern theory of relativity. Blandford 
and Thorne (2004) in their "Applications of Classical Physics" (http: //www.pma. caltech. 
edu/Courses/phl36/yr2004/ ), write the following about imaginary time: "(i) it hides the 
true physical geometry of Minkowski spacetime, (ii) it cannot be extended in any reasonable 
manner to non- orthonormal bases in flat spacetime, and (iii) it cannot be extended in any 
reasonable manner to the curvilinear coordinates that one must use in general relativity." 

If we reduce the spatial dimensions to 2, so that we can represent the physics in a 3-D space 

ds 2 = dx 2 + dx 2 — c 2 dt 2 


We see that things such as light which move at the speed of light lie along a dual-cone: 



defined by the equation 


ds 2 = 0 = dx 2 + dx 2 — c 2 dt 2 
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or 

dx 2 + dx 2 = c 2 dt 2 

Which is the equation of a circle with r = cdt. The path of something that moves at the 
speed of light is known as a null geodesic. If we extend the equation above to three spatial 
dimensions, the null geodesics are continuous concentric spheres, with radius = distance = 
cx (it time). 


t=lc 





t = 4r 


Figure 62 

ds 2 = 0 = dx\ + dx 2 + dx\ — c 2 dt 2 
dx 2 + dx 2 + dx 3 = c 2 dt 2 

This null dual-cone represents the "line of sight" of a point in space. That is, when we look 
at the stars and say "The light from that star which I am receiving is X years old.", we are 
looking down this line of sight: a null geodesic. We are looking at an event d = \J x 2 + x 2 + x 2 
meters away and d/c seconds in the past. For this reason the null dual cone is also known 
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as the 'light cone'. (The point in the lower left of the picture below represents the star, the 
origin represents the observer, and the line represents the null geodesic "line of sight".) 



Figure 63 


The cone in the -t region is the information that the point is 'receiving', while the cone in 
the +t section is the information that the point is 'sending'. 

7.1.6 Length contraction, time dilation and phase 

Consider two inertial frames in standard configuration. There is a rigid rod moving along in 
the second frame at v m/s. The length of the rod is determined by observing the positions 
of the end points of the rod simultaneously - if the rod is moving it would be nonsense to 
use any other measure of length. An observer who is moving at the same velocity as the rod 
measures its "rest length". The Lorentz transformation for coordinates along the x axis is: 

x '=i (x-vt) 

Suppose the positions, x\,X2, of the two ends of the rod are determined simultaneously (ie: 
t is constant): 

( X ' 1 -x2 )= ' y(xi - x 2 ) 
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/ / 

Or, using Lq = (x 1_a;2 for the rest length of the rod and L = ( x\ —X2) for the length of the 
rod that is measured by the observer who sees it fly past at v m/s: 

L 0 = 7 L 

Or, elaborating 7: 

L = L 0v /1 — v 2 /c 2 

In other words the length of an object moving with velocity v is contracted in the direction 
of motion by a factor yj 1 — v 2 /c 2 in the direction of motion. 

The Lorentz transformation also affects the rate at which clocks appear to change their 
readings. The Lorentz transformation for time is: 

^'=7 (t—vx/c 2 ) 

This transformation has two components: 

^ ="ft—'yvx / c 2 

and is a straight line graph (ie: t' =mt+c ). 

The gradient of the graph is 7 so: 

A t'=r At 


or: 

Therefore clocks in the moving frame will appear to go slow, if Tq is a time interval in the 
rest frame and T is a time interval in the moving frame: 

T = 7T0 
Or, expanding: 

T = To 

\J 1 — v^/c 2 

The intercept of the graph is: 

—7 vx/(? 

This means that if a clock at point x is compared with a clock that was synchronised between 
frames at the origin it will show a constant time difference of yux/c 2 seconds. This quantity 
is known as the relativistic phase difference or "phase". 
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Time dilation and phase 


t 



The graph of: 

t = 7 1 — 71T /c 2 

For clocks synchronised at 
x=0 (the origin). 



The graph of: 

t =7 t — 7l'l/c 2 

For clocks synchronised at 
X=5. 


A comparison of the x and t axes in the 
standard configuration. 


t t 




Figure 64 


The relativistic phase is as important as the length contraction and time dilation results. It 
is the amount by which clocks that are synchronised at the origin go out of synchronisation 
with distance along the direction of travel. Phase affects all clocks except those at the point 
where clocks are syncronised and the infinitessimal y and z planes that cut this point. All 
clocks everywhere else will be out of synchronisation between the frames. The effect of phase 
is shown in the illustration below: 
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The Relativity of Simultaneity and Phase 


t 




Phase describes how events that one 
observer measures to be simultaneous are 
not simultaneous for another observer. 


Figure 65 


If the inertial frames are each composed of arrays of clocks spread over space then the clocks 
will be out of synchronisation as shown in the illustration above. 

It is interesting to see how the phase term cancels out when time differences are being 
considered. It cancels out because the phase term x v ' c is applied to the clock that is at a 
constant position in the primed frame. The Lorentz transform for time is: 

^+(v/c 2 )x 

a/(1 -v 2 /c 2 ) 


A difference between two times is: 


The important thing to note is that the clock is at the same position x in its own reference 
frame. 

The result is that: 
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1 1 — 12 


U h 


^J{l-v 2 /c 2 ) 


Where the phase terms have cancelled. 

• The Time Dilation Measurement through the Ives-Stilwell Experiment 6 




Figure 66 Definition sketch for understanding the Lorentz contraction 


7.2 Hyperbolic geometry 


In the flat spacetime of Special Relativity: 
s 2 = c 2 t 2 — x 2 — y 2 — z 2 
Considering the x-axis alone: 
s 2 = c 2 t 2 — x 2 


The standard equation of a hyperbola is: 


x 2 _ y 2 
a 1 T? 


= 1 


In the case of spacetime: 



Spacetime intervals separate one place or event in spacetime from another. So, for a given 
motion from one place to another or a given fixed length in one reference frame, given 
time interval etc. the metric of spacetime describes a hyperbolic space. This hyperbolic 
space encompasses the coordinates of all the observations made of the given interval by any 
observers. 


6 http : //www . wbabin .net/ sf arti/ sfartil7.pdf 
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Hyperbolic spacetime 

t 




A plot of the hyperbola that represents all values of 
t and x that observers might measure for a given 
spacetime interval. 


Figure 67 


It is possible to conceive of rotations in hyperbolic space in a similar way to rotations in 
Euclidean space. The idea of a rotation in hyperbolic space is summarised in the illustration 
below: 
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Euclidean and hyperbolic geometry 

Euclidean triangle 


r cos a 


r sin a 



4 2 + 3 2 = S 2 


y = r sin a 
x = r cos a 




Euclidean rotation 



Hyperbolic triangle 




Hyperbola 



t = s sinh a 
x = s cosh a 


Hyperbolic rotation 



Figure 68 


A rotation in hyperbolic space is equivalent to changing from one frame of reference to 
another whilst observing the same spacetime interval. It is moving from coordinates that 
give: 

( ct ) 2 — x 2 = s 2 
to coordinates that give: 



The formula for a rotation in hyperbolic space provides an alternative form of the Lorentz 
transformation ie: 


ct 


coshi 

sinh(^> 

ct' 

X 


sinh(^> 

coshi^ 

x' 


From which: 

j. — q.' cosh <fi+ct smh 0 


140 


Hyperbolic geometry 


^,'sinh (p+ct coah ‘I’ 

The value of cf) can be determined by considering the coordinates assigned to a moving light 
that moves along the x axis from the origin at vmsec 1 flashes on for t seconds then flashes 
off. 

The coordinates assigned by an observer on the light are: i 7 , 0,0,0, the coordinates assigned 
by the stationary observer are t, x = vt, 0,0. The hyperbola representing these observations 
is illustrated below: 

The value of O from the equation 



S 


2 





tanh (p 


= vt/ct = v/c 


Figure 69 


The equation of the hyperbola is: 
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( ct ) 2 — x 2 = s 2 = ( ct 

but x=vt for the end of the flash so: 

tanh <t>=- 

Now, from hyperbolic trigonometry: 

. 1 =J ■ = cosh</> 

Y 1— tanh 2 V cosh ^~ smh ^ 

But tanh (p = - so: 

cosh,#= 7ri73 = 7 

and, from the hyperbolic trigonometric formula sinhc/> = tanh 0 cosh f/i: 
sinhc/> = ^7 

Inserting these values into the equations for the hyperbolic rotation: 

j,'cosh0+rf ,slnh ^ 

i+rt'iv/c 

x = yx ^ 

Which gives the standard transform for x: 
x = 7(V +i;f } 

In a similar way ct = x' sm ^^ +ct c ° sh * is equivalent to: 
t = 7 (7'+Wc 2 ) 

So the Lorentz transformations can also be derived from the assumption that boosts are 
equivalent to rotations in hyperbolic space with a metric -s 2 = c 2 t 2 — x 2 — y 2 — z 2 . 

The quantity 4 > is known as the rapidity of the boost. 


7.2.1 Addition of velocities 

Suppose there are three observers 1,2, and 3 who are moving at different velocities along 
the x-axis. Observers 1 and 2 are moving at a relative velocity v and observers 2 and 3 are 
moving at a relative velocity of u . The problem is to determine the velocity of observer 3 
as observed by observer 1 (u). 

It turns out that there is a very convenient relationship between rapidities that solves this 
problem: 

If v/c = tanh <p and u / c=tanha then: 
u/c = tanh(</> + a) 

In other words the rapidities can be simply added from one observer to another ie: 

a = (f> + a 

Hence: 
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tanh(cr) = tanh(^ + o:) 

So the velocities can be added by simply adding the rapidities. Using hyperbolic trigonometry: 
u/c = tanh (a + </>) = Therefore: 


Which is the relativistic velocity addition theorem. 
The relationship u/c = tanh^ + a) is shown below: 

Relative velocities 

time 



u/c. = tanh.fa' + (j>) 

From the viewpoint of observer 3 
observer 2 is separated by angle 
Q' and observer 1 is separated 
from observer 2 by angle Q . 


Figure 70 


143 


Mathematical Transformations 


Velocity transformations can be obtained without referring to the rapidity. The general case 
of the transformation of velocities in any direction is derived as follows: 

' s) 

' ( ' u2 U 

U = (ui’ U 

where u' etc. are the components of the velocity in the x, y, z directions. 

Writing out the components of velocity: 


tti 


=dx / dt 


U 2 


L=dy ! dt 


' =dz ! dt 
U3 


But from the Lorentz transformations: 

^ x ' ="t{dx—vdt) 

dy' =dy 

dz' =dz 

=7 (dt—vdx /c 2 ) 


Therefore: 

/ 'y(dx-vdt) 

/ j'/dt l(dt-vdx/c 2 ) 

/_ dy 

l —A n !/dt l(dt-vdx/c 2 ) 

^ 2 a y 

/_ dz 

l —d z '/dt 7 (dt-vdx/d 2 ) 


Dividing top and bottom of each fraction by dt: 

/ ^(dx/dt — v) 

11^- ' y(l—vdx/dt/c 2) 

/ _ dy/dt 

11^ 7 ( 1 — vdx/dt/c 

/ dz / dt 

-^3 7(1 — vdx / dt / c"^) 

Substituting u = (ui, 112,113) 

I u — v 

1 — uv / C 2 

/ = u 2 

7(1 — uv/c“^) 

/ = U 3 

^3 7(1 — uv / c“^) 

The full velocity transformations are tabulated below: 
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7.3 Mathematics of the Lorentz Transformation Equations 

Consider two observers O and O' , moving at velocity v relative to each other who synchronise 
their clocks so that t = t =0 as they pass each other. They both observe the same event as a 
flash of light. How will the coordinates recorded by the observers of the event that produced 
the light be interrelated? 

The relationship between the coordinates can be derived using linear algebra on the basis of 
the postulates of relativity and an extra homogeneity and isotropy assumption. 

The homogeneity and isotropy assumption: space is uniform and homogeneous in 

all directions. If this were not the case then when comparing lengths between coordinate 

systems the lengths would depend upon the position of the measurement. For instance, if 
/ 2 

x ~ ax the distance between two points would depend upon position. 

The linear equations relating coordinates in the primed and unprimed frames are: 


x '=aiix+a 1 2y+ai3Z+ai4t 


f 

y 


=a2ix+a,22y+a23Z+a24t 


J =a,3ix+a,32y+a33Z+a34t 


^ =CL4\X+a42y+a43Z+a44,t 

There is no relative motion in the y or z directions so, according to the 'relativity' postulate: 



Hence: 

022 = 

033 = 

So the 

x ' =aux+ai2y+ai3Z+ai4t 
t =a4ix+a42y+a,43Z+a44t 


1 and a 2 i = 023 = a 2 4 = 0 
1 and 031 = 032 = 034 = 0 
following equations remain to be solved: 
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If space is isotropic (the same in all directions) then the motion of clocks should be 
independent of the y and z axes (otherwise clocks placed symmetrically around the x-axis 
would appear to disagree). Hence: 


042 — 043 — 0 


so: 


— CL 4 I X~\~CL44 t 

Events satisfying x =0 must also satisfy x = vt. So: 

0 = anvt + ai2y + ai3Z + aut 


and 


—anvt = auy + 043,2 + a 14 f 

Given that the equations are linear then any + 0432 = 0 and: 


— a\\vt — 044^ 


and 


— 044U — 044 


Therefore the correct transformation equation for x is: 


X '=au(x-vt) 

The analysis to date gives the following equations: 


X '=au(x-vt) 



I 

t 


— d'4\X~\-CL44:t 
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Assuming that the speed of light is constant, the coordinates of a flash of light that expands 
as a sphere will satisfy the following equations in each coordinate system: 


2 , 2 , 2 2 j.2 

x + y +z = c t 


'2 , '2 i '2 2/2 

x + y +z = c t 

Substituting the coordinate transformation equations into the second equation gives: 


a n( x ~ vt ) 2 + y 2 + z 2 = c 2 (a, 4 ix + duty 

rearranging: 

(a \ 4 ~ yajyx 2 + y 2 + z 2 — 2 (vaf 1 + c 2 a4ia44)xt = (c 2 aj 4 — v 2 af 1 )t 2 
We demand that this is equivalent with 


So we get: 


2 , 2 , 2 2 j.2 

x +y +z = c t 


2 2 2 2 2 
c a 44 — v a n = c 


2 2 2 1 
ci\\ c — 1 


o o 

va 44 + C 041044 = 0 


Solving these 3 simultaneous equations gives: 


044 — 


1 


On — 


041 = 


V (1 — V 2 /c 2 ) 

1 

V(1 -v 2 /c 2 ) 

V / c 2 

\/(l -u 2 /c 2 ) 


Substituting these values into: 


r '=au(x—vt) 
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/ 

t 


— CL 41 OC CL 4.4. t 


gives: 


/ x — vt 

X yj(l-v 2 /c 2 ) 



/ t—(v/c 2 )x 

t \J (1 — V 2 / c 2 ) 


The inverse transformation is: 


x' +vt ' 

sJ{l~V 2 /c 2 ) 


y = y 


z = z 


^ +{v / c 2 )x 

\/(l -v 2 /c 2 ) 


7.3.1 Einstein's original approach 

How would two observers measure the position and timing of an event by using light rays 
if the speed of light were constant? The modern analysis of this problem, exposing the 
assumptions involved, is given above but Einstein's original reasoning (Einstein 1905,1920) 
is as follows. 

Light is transmitted along the positive x axis according to the equation x = ct where c is 
the velocity of light. This can be rewritten as: 
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x — ct = 0 


Another observer, moving relatively to the first may find different values for x and t but the 
same equation will apply: 


A simple relationship between these formulae, which apply to the same event is: 


/ ' — C t')= x (x-ct) 

[x 

Light is transmitted along the negative x axis according to the equation x = —ct where c is 
the velocity of light. This can be rewritten as: 


x + ct = 0 
x' +ct ' = ° 


And: 


[X 

Adding the equations and substituting a = and b = : 

(1) x’ = ax — bet 

(2) ct ' = act — bx 

The origin of one set of coordinates can be set so that x' =0 hence: 

be 

x = — t 
a 

If v is the velocity of one observer relative to the other then v = f and: 

( 3 ) «=£ 

At t, = 0: 

(4) x= ax 

Therefore two points separated by unit distance in the primed frame of reference ie: when 
x =1 have the following separation in the unprimed frame: 

(5) Ax = l 

Now t can be eliminated from equations (1) and (2) and combined with v = ^ and (4) to 
give in the case where x = 1 and t' =0 : 
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( 6 ) X =a{1 -^ )x 

And, if Ax = 1: 

(7) Ax' =a(1 ~^ } 

Now if the two moving systems are identical and the situation is symmetrical a measurement 
in the unprimed system of a division showing one metre on a measuring rod in the primed 
system is going to be identical to a measurement in the primed system of a division showing 
one metre on a measuring rod in the unprimed system. Thus (5) and (7) can be combined 
so that: 


1 

a 


a(l — 


) 


So: 


a 2 = 


(l"g) 


Inserting this value for a into equations (1) and (2) and solving for b gives: 


x 


x — vt 



/ t — (v/c^)x 



These are the Lorentz Transformation Equations for events on the x axis. 

Einstein, A. (1920). Relativity. The Special and General Theory. Methuen & Co Ltd 
1920. Written December, 1916. Robert W. Lawson (Authorised translation). http://www. 
bartleby.com/173/ 
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9 Licenses 


9.1 GNU GENERAL PUBLIC LICENSE 


Version 3, 29 June 2007 

Copyright © 2007 Free Software Foundation, Inc. 
<http://fsf.org/> 

Everyone is permitted to copy and distribute verba- 
tim copies of this license document, but changing 
it is not allowed. Preamble 

The GNU General Public License is a free, copyleft 
license for software and other kinds of works. 

The licenses for most software and other practi- 
cal works are designed to take away your freedom 
to share and change the works. By contrast, the 
GNU General Public License is intended to guaran- 
tee your freedom to share and change all versions 
of a program— to make sure it remains free software 
for all its users. We, the Free Software Foundation, 
use the GNU General Public License for most of our 
software; it applies also to any other work released 
this way by its authors. You can apply it to your 
programs, too. 

When we speak of free software, we are referring 
to freedom, not price. Our General Public Li- 
censes are designed to make sure that you have 
the freedom to distribute copies of free software 
(and charge for them if you wish), that you receive 
source code or can get it if you want it, that you 
can change the software or use pieces of it in new 
free programs, and that you know you can do these 
things. 

To protect your rights, we need to prevent others 
from denying you these rights or asking you to sur- 
render the rights. Therefore, you have certain re- 
sponsibilities if you distribute copies of the soft- 
ware, or if you modify it: responsibilities to respect 
the freedom of others. 

For example, if you distribute copies of such a pro- 
gram, whether gratis or for a fee, you must pass 
on to the recipients the same freedoms that you re- 
ceived. You must make sure that they, too, receive 
or can get the source code. And you must show 
them these terms so they know their rights. 

Developers that use the GNU GPL protect your 
rights with two steps: (1) assert copyright on the 
software, and (2) offer you this License giving you 
legal permission to copy, distribute and/or modify 
it. 

For the developers’ and authors’ protection, the 
GPL clearly explains that there is no warranty for 
this free software. For both users’ and authors’ 
sake, the GPL requires that modified versions be 
marked as changed, so that their problems will not 
be attributed erroneously to authors of previous 

Some devices are designed to deny users access to 
install or run modified versions of the software in- 
side them, although the manufacturer can do so. 
This is fundamentally incompatible with the aim 
of protecting users’ freedom to change the software. 
The systematic pattern of such abuse occurs in the 
area of products for individuals to use, which is 
precisely where it is most unacceptable. Therefore, 
we have designed this version of the GPL to pro- 
hibit the practice for those products. If such prob- 
lems arise substantially in other domains, we stand 
ready to extend this provision to those domains in 
future versions of the GPL, as needed to protect 
the freedom of users. 

Finally, every program is threatened constantly by 
software patents. States should not allow patents 
to restrict development and use of software on 
general-purpose computers, but in those that do, 
we wish to avoid the special danger that patents 
applied to a free program could make it effectively 
proprietary. To prevent this, the GPL assures that 
patents cannot be used to render the program non- 

The precise terms and conditions for copying, dis- 
tribution and modification follow. TERMS AND 
CONDITIONS 0. Definitions. 

“This License” refers to version 3 of the GNU Gen- 
eral Public License. 

“Copyright” also means copyright-like laws that ap- 
ply to other kinds of works, such as semiconductor 
masks. 

“The Program” refers to any copyrightable work 
licensed under this License. Each licensee is ad- 
dressed as “you”. “Licensees” and “recipients” may 
be individuals or organizations. 

To “modify” a work means to copy from or adapt 
all or part of the work in a fashion requiring copy- 
right permission, other than the making of an exact 
copy. The resulting work is called a “modified ver- 
sion” of the earlier work or a work “based on” the 
earlier work. 

A “covered work” means either the unmodified Pro- 
gram or a work based on the Program. 

To “propagate” a work means to do anything with it 
that, without permission, would make you directly 
or secondarily liable for infringement under appli- 
cable copyright law, except executing it on a com- 
puter or modifying a private copy. Propagation in- 
cludes copying, distribution (with or without mod- 
ification), making available to the public, and in 
some countries other activities as well. 

To “convey” a work means any kind of propagation 
that enables other parties to make or receive copies. 
Mere interaction with a user through a computer 


network, with no transfer of a copy, is not convey- 
ing. 

An interactive user interface displays “Appropriate 
Legal Notices” to the extent that it includes a con- 
venient and prominently visible feature that (1) dis- 
plays an appropriate copyright notice, and (2) tells 
the user that there is no warranty for the work (ex- 
cept to the extent that warranties are provided), 
that licensees may convey the work under this Li- 
cense, and how to view a copy of this License. If 
the interface presents a list of user commands or 
options, such as a menu, a prominent item in the 
list meets this criterion. 1. Source Code. 

The “source code” for a work means the preferred 
form of the work for making modifications to it. 
“Object code” means any non-source form of a 
work. 

A “Standard Interface” means an interface that ei- 
ther is an official standard defined by a recognized 
standards body, or, in the case of interfaces spec- 
ified for a particular programming language, one 
that is widely used among developers working in 
that language. 

The “System Libraries” of an executable work in- 
clude anything, other than the work as a whole, 
that (a) is included in the normal form of packag- 
ing a Major Component, but which is not part of 
that Major Component, and (b) serves only to en- 
able use of the work with that Major Component, 
or to implement a Standard Interface for which an 
implementation is available to the public in source 
code form. A “Major Component”, in this context, 
means a major essential component (kernel, window 
system, and so on) of the specific operating system 
(if any) on which the executable work runs, or a 
compiler used to produce the work, or an object 
code interpreter used to run it. 

The “Corresponding Source” for a work in object 
code form means all the source code needed to gen- 
erate, install, and (for an executable work) run 
the object code and to modify the work, including 
scripts to control those activities. However, it does 
not include the work’s System Libraries, or general- 
purpose tools or generally available free programs 
which are used unmodified in performing those ac- 
tivities but which are not part of the work. For 
example, Corresponding Source includes interface 
definition files associated with source files for the 
work, and the source code for shared libraries and 
dynamically linked subprograms that the work is 
specifically designed to require, such as by intimate 
data communication or control flow between those 
subprograms and other parts of the work. 

The Corresponding Source need not include any- 
thing that users can regenerate automatically from 
other parts of the Corresponding Source. 

The Corresponding Source for a work in source code 
form is that same work. 2. Basic Permissions. 

All rights granted under this License are granted 
for the term of copyright on the Program, and are 
irrevocable provided the stated conditions are met. 
This License explicitly affirms your unlimited per- 
mission to run the unmodified Program. The out- 
put from running a covered work is covered by this 
License only if the output, given its content, con- 
stitutes a covered work. This License acknowledges 
your rights of fair use or other equivalent, as pro- 
vided by copyright law. 

You may make, run and propagate covered works 
that you do not convey, without conditions so long 
as your license otherwise remains in force. You may 
convey covered works to others for the sole purpose 
of having them make modifications exclusively for 
you, or provide you with facilities for running those 
works, provided that you comply with the terms 
of this License in conveying all material for which 
you do not control copyright. Those thus making or 
running the covered works for you must do so exclu- 
sively on your behalf, under your direction and con- 
trol, on terms that prohibit them from making any 
copies of your copyrighted material outside their 
relationship with you. 

Conveying under any other circumstances is permit- 
ted solely under the conditions stated below. Subli- 
censing is not allowed; section 10 makes it unneces- 
sary. 3. Protecting Users’ Legal Rights From Anti- 
Circumvention Law. 

No covered work shall be deemed part of an effec- 
tive technological measure under any applicable law 
fulfilling obligations under article 11 of the WIPO 
copyright treaty adopted on 20 December 1996, or 
similar laws prohibiting or restricting circumven- 
tion of such measures. 

When you convey a covered work, you waive any 
legal power to forbid circumvention of technologi- 
cal measures to the extent such circumvention is ef- 
fected by exercising rights under this License with 
respect to the covered work, and you disclaim any 
intention to limit operation or modification of the 
work as a means of enforcing, against the work’s 
users, your or third parties’ legal rights to forbid 
circumvention of technological measures. 4. Con- 
veying Verbatim Copies. 

You may convey verbatim copies of the Program’s 
source code as you receive it, in any medium, pro- 
vided that you conspicuously and appropriately 
publish on each copy an appropriate copyright no- 
tice; keep intact all notices stating that this License 
and any non-permissive terms added in accord with 
section 7 apply to the code; keep intact all notices 
of the absence of any warranty; and give all recipi- 
ents a copy of this License along with the Program. 


You may charge any price or no price for each copy 
that you convey, and you may offer support or war- 
ranty protection for a fee. 5. Conveying Modified 
Source Versions. 

You may convey a work based on the Program, or 
the modifications to produce it from the Program, 
in the form of source code under the terms of sec- 
tion 4, provided that you also meet all of these con- 
ditions: 

* a) The work must carry prominent notices stating 
that you modified it, and giving a relevant date. * 
b) The work must carry prominent notices stating 
that it is released under this License and any con- 
ditions added under section 7. This requirement 
modifies the requirement in section 4 to “keep in- 
tact all notices”. * c) You must license the entire 
work, as a whole, under this License to anyone who 
comes into possession of a copy. This License will 
therefore apply, along with any applicable section 7 
additional terms, to the whole of the work, and all 
its parts, regardless of how they are packaged. This 
License gives no permission to license the work in 
any other way, but it does not invalidate such per- 
mission if you have separately received it. * d) If 
the work has interactive user interfaces, each must 
display Appropriate Legal Notices; however, if the 
Program has interactive interfaces that do not dis- 
play Appropriate Legal Notices, your work need not 
make them do so. 

A compilation of a covered work with other sepa- 
rate and independent works, which are not by their 
nature extensions of the covered work, and which 
are not combined with it such as to form a larger 
program, in or on a volume of a storage or distri- 
bution medium, is called an “aggregate” if the com- 
pilation and its resulting copyright are not used to 
limit the access or legal rights of the compilation’s 
users beyond what the individual works permit. In- 
clusion of a covered work in an aggregate does not 
cause this License to apply to the other parts of the 
aggregate. 6. Conveying Non-Source Forms. 

You may convey a covered work in object code form 
under the terms of sections 4 and 5, provided that 
you also convey the machine-readable Correspond- 
ing Source under the terms of this License, in one 
of these ways: 

* a) Convey the object code in, or embodied in, 
a physical product (including a physical distribu- 
tion medium), accompanied by the Corresponding 
Source fixed on a durable physical medium custom- 
arily used for software interchange. * b) Convey the 
object code in, or embodied in, a physical product 
(including a physical distribution medium), accom- 
panied by a written offer, valid for at least three 
years and valid for as long as you offer spare parts 
or customer support for that product model, to 
give anyone who possesses the object code either 
(1) a copy of the Corresponding Source for all the 
software in the product that is covered by this Li- 
cense, on a durable physical medium customarily 
used for software interchange, for a price no more 
than your reasonable cost of physically performing 
this conveying of source, or (2) access to copy the 
Corresponding Source from a network server at no 
charge. * c) Convey individual copies of the object 
code with a copy of the written offer to provide 
the Corresponding Source. This alternative is al- 
lowed only occasionally and noncommercially, and 
only if you received the object code with such an of- 
fer, in accord with subsection 6b. * d) Convey the 
object code by offering access from a designated 
place (gratis or for a charge), and offer equivalent 
access to the Corresponding Source in the same way 
through the same place at no further charge. You 
need not require recipients to copy the Correspond- 
ing Source along with the object code. If the place 
to copy the object code is a network server, the Cor- 
responding Source may be on a different server (op- 
erated by you or a third party) that supports equiv- 
alent copying facilities, provided you maintain clear 
directions next to the object code saying where to 
find the Corresponding Source. Regardless of what 
server hosts the Corresponding Source, you remain 
obligated to ensure that it is available for as long 
as needed to satisfy these requirements. * e) Con- 
vey the object code using peer-to-peer transmission, 
provided you inform other peers where the object 
code and Corresponding Source of the work are be- 
ing offered to the general public at no charge under 
subsection 6d. 

A separable portion of the object code, whose 
source code is excluded from the Corresponding 
Source as a System Library, need not be included 
in conveying the object code work. 

A “User Product” is either (1) a “consumer prod- 
uct”, which means any tangible personal property 
which is normally used for personal, family, or 
household purposes, or (2) anything designed or 
sold for incorporation into a dwelling. In deter- 
mining whether a product is a consumer product, 
doubtful cases shall be resolved in favor of cover- 
age. For a particular product received by a par- 
ticular user, “normally used” refers to a typical or 
common use of that class of product, regardless of 
the status of the particular user or of the way in 
which the particular user actually uses, or expects 
or is expected to use, the product. A product is a 
consumer product regardless of whether the prod- 
uct has substantial commercial, industrial or non- 
consumer uses, unless such uses represent the only 
significant mode of use of the product. 

“Installation Information” for a User Product 
means any methods, procedures, authorization 
keys, or other information required to install and 
execute modified versions of a covered work in that 
User Product from a modified version of its Corre- 
sponding Source. The information must suffice to 
ensure that the continued functioning of the modi- 
fied object code is in no case prevented or interfered 
with solely because modification has been made. 


If you convey an object code work under this sec- 
tion in, or with, or specifically for use in, a User 
Product, and the conveying occurs as part of a 
transaction in which the right of possession and 
use of the User Product is transferred to the re- 
cipient in perpetuity or for a fixed term (regard- 
less of how the transaction is characterized), the 
Corresponding Source conveyed under this section 
must be accompanied by the Installation Informa- 
tion. But this requirement does not apply if neither 
you nor any third party retains the ability to install 
modified object code on the User Product (for ex- 
ample, the work has been installed in ROM). 

The requirement to provide Installation Informa- 
tion does not include a requirement to continue to 
provide support service, warranty, or updates for a 
work that has been modified or installed by the re- 
cipient, or for the User Product in which it has been 
modified or installed. Access to a network may be 
denied when the modification itself materially and 
adversely affects the operation of the network or 
violates the rules and protocols for communication 
across the network. 

Corresponding Source conveyed, and Installation 
Information provided, in accord with this section 
must be in a format that is publicly documented 
(and with an implementation available to the public 
in source code form), and must require no special 
password or key for unpacking, reading or copying. 
7. Additional Terms. 

“Additional permissions” are terms that supplement 
the terms of this License by making exceptions from 
one or more of its conditions. Additional permis- 
sions that are applicable to the entire Program 
shall be treated as though they were included in 
this License, to the extent that they are valid un- 
der applicable law. If additional permissions apply 
only to part of the Program, that part may be used 
separately under those permissions, but the entire 
Program remains governed by this License without 
regard to the additional permissions. 

When you convey a copy of a covered work, you may 
at your option remove any additional permissions 
from that copy, or from any part of it. (Additional 
permissions may be written to require their own re- 
moval in certain cases when you modify the work.) 
You may place additional permissions on material, 
added by you to a covered work, for which you have 
or can give appropriate copyright permission. 

Notwithstanding any other provision of this Li- 
cense, for material you add to a covered work, you 
may (if authorized by the copyright holders of that 
material) supplement the terms of this License with 
terms: 

* a) Disclaiming warranty or limiting liability dif- 
ferently from the terms of sections 15 and 16 of this 
License; or * b) Requiring preservation of specified 
reasonable legal notices or author attributions in 
that material or in the Appropriate Legal Notices 
displayed by works containing it; or * c) Prohibit- 
ing misrepresentation of the origin of that material, 
or requiring that modified versions of such material 
be marked in reasonable ways as different from the 
original version; or * d) Limiting the use for pub- 
licity purposes of names of licensors or authors of 
the material; or * e) Declining to grant rights under 
trademark law for use of some trade names, trade- 
marks, or service marks; or * f) Requiring indem- 
nification of licensors and authors of that material 
by anyone who conveys the material (or modified 
versions of it) with contractual assumptions of lia- 
bility to the recipient, for any liability that these 
contractual assumptions directly impose on those 
licensors and authors. 

All other non-permissive additional terms are con- 
sidered “further restrictions” within the meaning of 
section 10. If the Program as you received it, or any 
part of it, contains a notice stating that it is gov- 
erned by this License along with a term that is a 
further restriction, you may remove that term. If a 
license document contains a further restriction but 
permits relicensing or conveying under this License, 
you may add to a covered work material governed 
by the terms of that license document, provided 
that the further restriction does not survive such 
relicensing or conveying. 

If you add terms to a covered work in accord with 
this section, you must place, in the relevant source 
files, a statement of the additional terms that ap- 
ply to those files, or a notice indicating where to 
find the applicable terms. 

Additional terms, permissive or non-permissive, 
may be stated in the form of a separately written 
license, or stated as exceptions; the above require- 
ments apply either way. 8. Termination. 

You may not propagate or modify a covered work 
except as expressly provided under this License. 
Any attempt otherwise to propagate or modify it is 
void, and will automatically terminate your rights 
under this License (including any patent licenses 
granted under the third paragraph of section 11). 

However, if you cease all violation of this License, 
then your license from a particular copyright holder 
is reinstated (a) provisionally, unless and until the 
copyright holder explicitly and finally terminates 
your license, and (b) permanently, if the copyright 
holder fails to notify you of the violation by some 
reasonable means prior to 60 days after the cessa- 


Moreover, your license from a particular copyright 
holder is reinstated permanently if the copyright 
holder notifies you of the violation by some reason- 
able means, this is the first time you have received 
notice of violation of this License (for any work) 



from that copyright holder, and you cure the vi- 
olation prior to 30 days after your receipt of the 
notice. 

Termination of your rights under this section does 
not terminate the licenses of parties who have re- 
ceived copies or rights from you under this License. 
If your rights have been terminated and not perma- 
nently reinstated, you do not qualify to receive new 
licenses for the same material under section 10. 9. 
Acceptance Not Required for Having Copies. 

You are not required to accept this License in or- 
der to receive or run a copy of the Program. Ancil- 
lary propagation of a covered work occurring solely 
as a consequence of using peer-to-peer transmission 
to receive a copy likewise does not require accep- 
tance. However, nothing other than this License 
grants you permission to propagate or modify any 
covered work. These actions infringe copyright if 
you do not accept this License. Therefore, by mod- 
ifying or propagating a covered work, you indicate 
your acceptance of this License to do so. 10. Auto- 
matic Licensing of Downstream Recipients. 

Each time you convey a covered work, the recipient 
automatically receives a license from the original 
licensors, to run, modify and propagate that work, 
subject to this License. You are not responsible 
for enforcing compliance by third parties with this 
License. 

An “entity transaction” is a transaction transfer- 
ring control of an organization, or substantially all 
assets of one, or subdividing an organization, or 
merging organizations. If propagation of a cov- 
ered work results from an entity transaction, each 
party to that transaction who receives a copy of the 
work also receives whatever licenses to the work the 
party’s predecessor in interest had or could give un- 
der the previous paragraph, plus a right to posses- 
sion of the Corresponding Source of the work from 
the predecessor in interest, if the predecessor has it 
or can get it with reasonable efforts. 

You may not impose any further restrictions on the 
exercise of the rights granted or affirmed under this 
License. For example, you may not impose a license 
fee, royalty, or other charge for exercise of rights 
granted under this License, and you may not ini- 
tiate litigation (including a cross-claim or counter- 
claim in a lawsuit) alleging that any patent claim 
is infringed by making, using, selling, offering for 
sale, or importing the Program or any portion of it. 
11. Patents. 

A “contributor” is a copyright holder who autho- 
rizes use under this License of the Program or a 
work on which the Program is based. The work 
thus licensed is called the contributor’s “contribu- 
tor version”. 

A contributor’s “essential patent claims” are all 
patent claims owned or controlled by the contribu- 
tor, whether already acquired or hereafter acquired, 
that would be infringed by some manner, permit- 
ted by this License, of making, using, or selling its 
contributor version, but do not include claims that 
would be infringed only as a consequence of further 
modification of the contributor version. For pur- 
poses of this definition, “control” includes the right 
to grant patent sublicenses in a manner consistent 
with the requirements of this License. 

Each contributor grants you a non-exclusive, world- 
wide, royalty-free patent license under the contrib- 
utor’s essential patent claims, to make, use, sell, of- 
fer for sale, import and otherwise run, modify and 
propagate the contents of its contributor version. 


In the following three paragraphs, a “patent li- 
cense” is any express agreement or commitment, 
however denominated, not to enforce a patent (such 
as an express permission to practice a patent or 
covenant not to sue for patent infringement). To 
“grant” such a patent license to a party means to 
make such an agreement or commitment not to en- 
force a patent against the party. 


If you convey a covered work, knowingly relying 
on a patent license, and the Corresponding Source 
of the work is not available for anyone to copy, 
free of charge and under the terms of this License, 
through a publicly available network server or other 
readily accessible means, then you must either (1) 
cause the Corresponding Source to be so available, 
or (2) arrange to deprive yourself of the benefit 
of the patent license for this particular work, or 
(3) arrange, in a manner consistent with the re- 
quirements of this License, to extend the patent 
license to downstream recipients. “Knowingly re- 
lying” means you have actual knowledge that, but 
for the patent license, your conveying the covered 
work in a country, or your recipient’s use of the cov- 
ered work in a country, would infringe one or more 
identifiable patents in that country that you have 
reason to believe are valid. 


If, pursuant to or in connection with a single trans- 
action or arrangement, you convey, or propagate 
by procuring conveyance of, a covered work, and 
grant a patent license to some of the parties re- 
ceiving the covered work authorizing them to use, 
propagate, modify or convey a specific copy of the 
covered work, then the patent license you grant is 
automatically extended to all recipients of the cov- 
ered work and works based on it. 


A patent license is “discriminatory” if it does not in- 
clude within the scope of its coverage, prohibits the 
exercise of, or is conditioned on the non-exercise 
of one or more of the rights that are specifically 
granted under this License. You may not convey a 
covered work if you are a party to an arrangement 
with a third party that is in the business of dis- 
tributing software, under which you make payment 
to the third party based on the extent of your ac- 
tivity of conveying the work, and under which the 
third party grants, to any of the parties who would 
receive the covered work from you, a discrimina- 
tory patent license (a) in connection with copies 
of the covered work conveyed by you (or copies 
made from those copies), or (b) primarily for and in 
connection with specific products or compilations 
that contain the covered work, unless you entered 
into that arrangement, or that patent license was 
granted, prior to 28 March 2007. 


Nothing in this License shall be construed as ex- 
cluding or limiting any implied license or other de- 
fenses to infringement that may otherwise be avail- 
able to you under applicable patent law. 12. No 
Surrender of Others’ Freedom. 


If conditions are imposed on you (whether by court 
order, agreement or otherwise) that contradict the 
conditions of this License, they do not excuse you 
from the conditions of this License. If you cannot 
convey a covered work so as to satisfy simultane- 
ously your obligations under this License and any 
other pertinent obligations, then as a consequence 
you may not convey it at all. For example, if you 
agree to terms that obligate you to collect a roy- 
alty for further conveying from those to whom you 
convey the Program, the only way you could satisfy 
both those terms and this License would be to re- 
frain entirely from conveying the Program. 13. Use 
with the GNU Affero General Public License. 


Notwithstanding any other provision of this Li- 
cense, you have permission to link or combine any 
covered work with a work licensed under version 
3 of the GNU Affero General Public License into 
a single combined work, and to convey the result- 
ing work. The terms of this License will continue 
to apply to the part which is the covered work, but 
the special requirements of the GNU Affero General 
Public License, section 13, concerning interaction 
through a network will apply to the combination 
as such. 14. Revised Versions of this License. 

The Free Software Foundation may publish revised 
and/or new versions of the GNU General Public Li- 
cense from time to time. Such new versions will be 
similar in spirit to the present version, but may dif- 
fer in detail to address new problems or concerns. 

Each version is given a distinguishing version num- 
ber. If the Program specifies that a certain num- 
bered version of the GNU General Public License 
“or any later version” applies to it, you have the 
option of following the terms and conditions either 
of that numbered version or of any later version 
published by the Free Software Foundation. If the 
Program does not specify a version number of the 
GNU General Public License, you may choose any 
version ever published by the Free Software Foun- 

If the Program specifies that a proxy can decide 
which future versions of the GNU General Public 
License can be used, that proxy’s public statement 
of acceptance of a version permanently authorizes 
you to choose that version for the Program. 

Later license versions may give you additional or 
different permissions. However, no additional obli- 
gations are imposed on any author or copyright 
holder as a result of your choosing to follow a later 
version. 15. Disclaimer of Warranty. 

THERE IS NO WARRANTY FOR THE PRO- 
GRAM, TO THE EXTENT PERMITTED BY AP- 
PLICABLE LAW. EXCEPT WHEN OTHERWISE 
STATED IN WRITING THE COPYRIGHT HOLD- 
ERS AND/OR OTHER PARTIES PROVIDE THE 
PROGRAM “AS IS” WITHOUT WARRANTY OF 
ANY KIND, EITHER EXPRESSED OR IMPLIED, 
INCLUDING, BUT NOT LIMITED TO, THE IM- 
PLIED WARRANTIES OF MERCHANTABILITY 
AND FITNESS FOR A PARTICULAR PURPOSE. 
THE ENTIRE RISK AS TO THE QUALITY AND 
PERFORMANCE OF THE PROGRAM IS WITH 
YOU. SHOULD THE PROGRAM PROVE DEFEC- 
TIVE, YOU ASSUME THE COST OF ALL NECES- 
SARY SERVICING, REPAIR OR CORRECTION. 
16. Limitation of Liability. 


IN NO EVENT UNLESS REQUIRED BY APPLI- 
CABLE LAW OR AGREED TO IN WRITING 
WILL ANY COPYRIGHT HOLDER, OR ANY 
OTHER PARTY WHO MODIFIES AND/OR CON- 
VEYS THE PROGRAM AS PERMITTED ABOVE, 
BE LIABLE TO YOU FOR DAMAGES, IN- 
CLUDING ANY GENERAL, SPECIAL, INCIDEN- 
TAL OR CONSEQUENTIAL DAMAGES ARISING 
OUT OF THE USE OR INABILITY TO USE 
THE PROGRAM (INCLUDING BUT NOT LIM- 
ITED TO LOSS OF DATA OR DATA BEING REN- 
DERED INACCURATE OR LOSSES SUSTAINED 
BY YOU OR THIRD PARTIES OR A FAILURE 
OF THE PROGRAM TO OPERATE WITH ANY 
OTHER PROGRAMS), EVEN IF SUCH HOLDER 
OR OTHER PARTY HAS BEEN ADVISED OF 
THE POSSIBILITY OF SUCH DAMAGES. 17. In- 
terpretation of Sections 15 and 16. 

If the disclaimer of warranty and limitation of lia- 
bility provided above cannot be given local legal ef- 


fect according to their terms, reviewing courts shall 
apply local law that most closely approximates an 
absolute waiver of all civil liability in connection 
with the Program, unless a warranty or assumption 
of liability accompanies a copy of the Program in 
return for a fee. 

END OF TERMS AND CONDITIONS How to Ap- 
ply These Terms to Your New Programs 

If you develop a new program, and you want it to 
be of the greatest possible use to the public, the 
best way to achieve this is to make it free software 
which everyone can redistribute and change under 
these terms. 

To do so, attach the following notices to the pro- 
gram. It is safest to attach them to the start of 
each source file to most effectively state the exclu- 
sion of warranty; and each file should have at least 
the “copyright” line and a pointer to where the full 
notice is found. 

<one line to give the program’s name and a brief 
idea of what it does.> Copyright (C) <year> 
<name of author> 

This program is free software: you can redistribute 
it and/or modify it under the terms of the GNU 
General Public License as published by the Free 
Software Foundation, either version 3 of the Li- 
cense, or (at your option) any later version. 

This program is distributed in the hope that 
it will be useful, but WITHOUT ANY WAR- 
RANTY; without even the implied warranty of 
MERCHANTABILITY or FITNESS FOR A PAR- 
TICULAR PURPOSE. See the GNU General Public 
License for more details. 

You should have received a copy of the GNU Gen- 
eral Public License along with this program. If not, 
see <http: / /www.gnu.org/licenses/>. 

Also add information on how to contact you by elec- 
tronic and paper mail. 

If the program does terminal interaction, make it 
output a short notice like this when it starts in an 
interactive mode: 

<program> Copyright (C) <year> <name of au- 
thor> This program comes with ABSOLUTELY 
NO WARRANTY; for details type ‘show w’. This is 
free software, and you are welcome to redistribute it 
under certain conditions; type ‘show c’ for details. 

The hypothetical commands ‘show w’ and ‘show c’ 
should show the appropriate parts of the General 
Public License. Of course, your program’s com- 
mands might be different; for a GUI interface, you 
would use an “about box”. 

You should also get your employer (if you work 
as a programmer) or school, if any, to sign a 
“copyright disclaimer” for the program, if nec- 
essary. For more information on this, and 
how to apply and follow the GNU GPL, see 
<http://www.gnu.org/licenses/>. 

The GNU General Public License does not permit 
incorporating your program into proprietary pro- 
grams. If your program is a subroutine library, you 
may consider it more useful to permit linking pro- 
prietary applications with the library. If this is 
what you want to do, use the GNU Lesser General 
Public License instead of this License. But first, 
please read <http://www.gnu.org/philosophy/why- 
not-lgpl.html>. 


9.2 GNU Free Documentation License 


Version 1.3, 3 November 2008 

Copyright © 2000, 2001, 2002, 2007, 2008 Free Soft- 
ware Foundation, Inc. <http://fsf.org/> 

Everyone is permitted to copy and distribute verba- 
tim copies of this license document, but changing 
it is not allowed. 0. PREAMBLE 

The purpose of this License is to make a manual, 
textbook, or other functional and useful document 
"free" in the sense of freedom: to assure everyone 
the effective freedom to copy and redistribute it, 
with or without modifying it, either commercially 
or noncommercially. Secondarily, this License pre- 
serves for the author and publisher a way to get 
credit for their work, while not being considered 
responsible for modifications made by others. 

This License is a kind of "copyleft", which means 
that derivative works of the document must them- 
selves be free in the same sense. It complements 
the GNU General Public License, which is a copy- 
left license designed for free software. 

We have designed this License in order to use it 
for manuals for free software, because free software 
needs free documentation: a free program should 
come with manuals providing the same freedoms 
that the software does. But this License is not lim- 
ited to software manuals; it can be used for any tex- 
tual work, regardless of subject matter or whether 
it is published as a printed book. We recommend 
this License principally for works whose purpose is 
instruction or reference. 1. APPLICABILITY AND 
DEFINITIONS 

This License applies to any manual or other work, 
in any medium, that contains a notice placed by the 
copyright holder saying it can be distributed under 
the terms of this License. Such a notice grants a 
world-wide, royalty-free license, unlimited in dura- 
tion, to use that work under the conditions stated 
herein. The "Document", below, refers to any such 
manual or work. Any member of the public is a li- 
censee, and is addressed as "you". You accept the 
license if you copy, modify or distribute the work 
in a way requiring permission under copyright law. 

A "Modified Version" of the Document means any 
work containing the Document or a portion of it, ei- 
ther copied verbatim, or with modifications and/or 
translated into another language. 

A "Secondary Section" is a named appendix or a 
front-matter section of the Document that deals ex- 
clusively with the relationship of the publishers or 


authors of the Document to the Document’s overall 
subject (or to related matters) and contains noth- 
ing that could fall directly within that overall sub- 
ject. (Thus, if the Document is in part a textbook 
of mathematics, a Secondary Section may not ex- 
plain any mathematics.) The relationship could be 
a matter of historical connection with the subject 
or with related matters, or of legal, commercial, 
philosophical, ethical or political position regard- 

The "Invariant Sections" are certain Secondary Sec- 
tions whose titles are designated, as being those of 
Invariant Sections, in the notice that says that the 
Document is released under this License. If a sec- 
tion does not fit the above definition of Secondary 
then it is not allowed to be designated as Invariant. 
The Document may contain zero Invariant Sections. 
If the Document does not identify any Invariant 
Sections then there are none. 

The "Cover Texts" are certain short passages of text 
that are listed, as Front-Cover Texts or Back-Cover 
Texts, in the notice that says that the Document is 
released under this License. A Front-Cover Text 
may be at most 5 words, and a Back-Cover Text 
may be at most 25 words. 

A "Transparent" copy of the Document means a 
machine-readable copy, represented in a format 
whose specification is available to the general pub- 
lic, that is suitable for revising the document 
straightforwardly with generic text editors or (for 
images composed of pixels) generic paint programs 
or (for drawings) some widely available drawing ed- 
itor, and that is suitable for input to text format- 
ters or for automatic translation to a variety of for- 
mats suitable for input to text formatters. A copy 
made in an otherwise Transparent file format whose 
markup, or absence of markup, has been arranged 
to thwart or discourage subsequent modification by 
readers is not Transparent. An image format is not 
Transparent if used for any substantial amount of 
text. A copy that is not "Transparent" is called 

Examples of suitable formats for Transparent 
copies include plain ASCII without markup, Tex- 
info input format, LaTeX input format, SGML or 
XML using a publicly available DTD, and standard- 
conforming simple HTML, PostScript or PDF de- 
signed for human modification. Examples of trans- 
parent image formats include PNG, XCF and JPG. 
Opaque formats include proprietary formats that 
can be read and edited only by proprietary word 
processors, SGML or XML for which the DTD 
and/or processing tools are not generally available, 
and the machine-generated HTML, PostScript or 


PDF produced by some word processors for output 
purposes only. 

The "Title Page" means, for a printed book, the 
title page itself, plus such following pages as are 
needed to hold, legibly, the material this License 
requires to appear in the title page. For works in 
formats which do not have any title page as such, 
"Title Page" means the text near the most promi- 
nent appearance of the work’s title, preceding the 
beginning of the body of the text. 

The "publisher" means any person or entity that 
distributes copies of the Document to the public. 

A section "Entitled XYZ" means a named subunit 
of the Document whose title either is precisely XYZ 
or contains XYZ in parentheses following text that 
translates XYZ in another language. (Here XYZ 
stands for a specific section name mentioned below, 
such as "Acknowledgements", "Dedications", "En- 
dorsements", or "History".) To "Preserve the Title" 
of such a section when you modify the Document 
means that it remains a section "Entitled XYZ" ac- 
cording to this definition. 

The Document may include Warranty Disclaimers 
next to the notice which states that this License 
applies to the Document. These Warranty Dis- 
claimers are considered to be included by reference 
in this License, but only as regards disclaiming war- 
ranties: any other implication that these Warranty 
Disclaimers may have is void and has no effect on 
the meaning of this License. 2. VERBATIM COPY- 
ING 

You may copy and distribute the Document in any 
medium, either commercially or noncommercially, 
provided that this License, the copyright notices, 
and the license notice saying this License applies to 
the Document are reproduced in all copies, and that 
you add no other conditions whatsoever to those 
of this License. You may not use technical mea- 
sures to obstruct or control the reading or further 
copying of the copies you make or distribute. How- 
ever, you may accept compensation in exchange for 
copies. If you distribute a large enough number of 
copies you must also follow the conditions in sec- 

You may also lend copies, under the same condi- 
tions stated above, and you may publicly display 
copies. 3. COPYING IN QUANTITY 

If you publish printed copies (or copies in media 
that commonly have printed covers) of the Doc- 
ument, numbering more than 100, and the Doc- 
ument’s license notice requires Cover Texts, you 


must enclose the copies in covers that carry, clearly 
and legibly, all these Cover Texts: Front- Cover 

Texts on the front cover, and Back-Cover Texts 
on the back cover. Both covers must also clearly 
and legibly identify you as the publisher of these 
copies. The front cover must present the full title 
with all words of the title equally prominent and 
visible. You may add other material on the covers 
in addition. Copying with changes limited to the 
covers, as long as they preserve the title of the Doc- 
ument and satisfy these conditions, can be treated 
as verbatim copying in other respects. 

If the required texts for either cover are too volu- 
minous to fit legibly, you should put the first ones 
listed (as many as fit reasonably) on the actual 
cover, and continue the rest onto adjacent pages. 

If you publish or distribute Opaque copies of the 
Document numbering more than 100, you must ei- 
ther include a machine-readable Transparent copy 
along with each Opaque copy, or state in or with 
each Opaque copy a computer-network location 
from which the general network-using public has 
access to download using public-standard network 
protocols a complete Transparent copy of the Doc- 
ument, free of added material. If you use the lat- 
ter option, you must take reasonably prudent steps, 
when you begin distribution of Opaque copies in 
quantity, to ensure that this Transparent copy will 
remain thus accessible at the stated location until 
at least one year after the last time you distribute 
an Opaque copy (directly or through your agents or 
retailers) of that edition to the public. 

It is requested, but not required, that you con- 
tact the authors of the Document well before redis- 
tributing any large number of copies, to give them 
a chance to provide you with an updated version of 
the Document. 4. MODIFICATIONS 

You may copy and distribute a Modified Version of 
the Document under the conditions of sections 2 
and 3 above, provided that you release the Modi- 
fied Version under precisely this License, with the 
Modified Version filling the role of the Document, 
thus licensing distribution and modification of the 
Modified Version to whoever possesses a copy of it. 
In addition, you must do these things in the Modi- 
fied Version: 

* A. Use in the Title Page (and on the covers, if 
any) a title distinct from that of the Document, 
and from those of previous versions (which should, 
if there were any, be listed in the History section 
of the Document). You may use the same title as 
a previous version if the original publisher of that 
version gives permission. * B. List on the Title 



Page, as authors, one or more persons or entities 
responsible for authorship of the modifications in 
the Modified Version, together with at least five of 
the principal authors of the Document (all of its 
principal authors, if it has fewer than five), unless 
they release you from this requirement. * C. State 
on the Title page the name of the publisher of the 
Modified Version, as the publisher. * D. Preserve 
all the copyright notices of the Document. * E. Add 
an appropriate copyright notice for your modifica- 
tions adjacent to the other copyright notices. * F. 
Include, immediately after the copyright notices, a 
license notice giving the public permission to use 
the Modified Version under the terms of this Li- 
cense, in the form shown in the Addendum below. * 
G. Preserve in that license notice the full lists of In- 
variant Sections and required Cover Texts given in 
the Document’s license notice. * H. Include an unal- 
tered copy of this License. * I. Preserve the section 
Entitled "History", Preserve its Title, and add to it 
an item stating at least the title, year, new authors, 
and publisher of the Modified Version as given on 
the Title Page. If there is no section Entitled "His- 
tory" in the Document, create one stating the title, 
year, authors, and publisher of the Document as 
given on its Title Page, then add an item describ- 
ing the Modified Version as stated in the previous 
sentence. * J. Preserve the network location, if any, 
given in the Document for public access to a Trans- 
parent copy of the Document, and likewise the net- 
work locations given in the Document for previous 
versions it was based on. These may be placed in 
the "History" section. You may omit a network lo- 
cation for a work that was published at least four 
years before the Document itself, or if the original 
publisher of the version it refers to gives permission. 
* K. For any section Entitled "Acknowledgements" 
or "Dedications", Preserve the Title of the section, 
and preserve in the section all the substance and 
tone of each of the contributor acknowledgements 
and/or dedications given therein. * L. Preserve all 
the Invariant Sections of the Document, unaltered 
in their text and in their titles. Section numbers or 
the equivalent are not considered part of the section 
titles. * M. Delete any section Entitled "Endorse- 
ments". Such a section may not be included in the 
Modified Version. * N. Do not retitle any existing 
section to be Entitled "Endorsements" or to conflict 
in title with any Invariant Section. * O. Preserve 
any Warranty Disclaimers. 

If the Modified Version includes new front-matter 
sections or appendices that qualify as Secondary 
Sections and contain no material copied from the 
Document, you may at your option designate some 
or all of these sections as invariant. To do this, add 
their titles to the list of Invariant Sections in the 
Modified Version’s license notice. These titles must 
be distinct from any other section titles. 

You may add a section Entitled "Endorsements", 
provided it contains nothing but endorsements of 
your Modified Version by various parties — for ex- 
ample, statements of peer review or that the text 
has been approved by an organization as the au- 
thoritative definition of a standard. 

You may add a passage of up to five words as a 
Front-Cover Text, and a passage of up to 25 words 
as a Back-Cover Text, to the end of the list of Cover 
Texts in the Modified Version. Only one passage of 
Front-Cover Text and one of Back-Cover Text may 
be added by (or through arrangements made by) 
any one entity. If the Document already includes 
a cover text for the same cover, previously added 
by you or by arrangement made by the same entity 
you are acting on behalf of, you may not add an- 


other; but you may replace the old one, on explicit 
permission from the previous publisher that added 
the old one. 

The author(s) and publisher(s) of the Document do 
not by this License give permission to use their 
names for publicity for or to assert or imply en- 
dorsement of any Modified Version. 5. COMBIN- 
ING DOCUMENTS 

You may combine the Document with other docu- 
ments released under this License, under the terms 
defined in section 4 above for modified versions, 
provided that you include in the combination all 
of the Invariant Sections of all of the original doc- 
uments, unmodified, and list them all as Invariant 
Sections of your combined work in its license no- 
tice, and that you preserve all their Warranty Dis- 
claimers. 

The combined work need only contain one copy of 
this License, and multiple identical Invariant Sec- 
tions may be replaced with a single copy. If there 
are multiple Invariant Sections with the same name 
but different contents, make the title of each such 
section unique by adding at the end of it, in paren- 
theses, the name of the original author or publisher 
of that section if known, or else a unique number. 
Make the same adjustment to the section titles in 
the list of Invariant Sections in the license notice 
of the combined work. 

In the combination, you must combine any sections 
Entitled "History" in the various original docu- 
ments, forming one section Entitled "History"; like- 
wise combine any sections Entitled "Acknowledge- 
ments", and any sections Entitled "Dedications". 
You must delete all sections Entitled "Endorse- 
ments". 6. COLLECTIONS OF DOCUMENTS 

You may make a collection consisting of the Docu- 
ment and other documents released under this Li- 
cense, and replace the individual copies of this Li- 
cense in the various documents with a single copy 
that is included in the collection, provided that you 
follow the rules of this License for verbatim copying 
of each of the documents in all other respects. 

You may extract a single document from such a col- 
lection, and distribute it individually under this Li- 
cense, provided you insert a copy of this License 
into the extracted document, and follow this Li- 
cense in all other respects regarding verbatim copy- 
ing of that document. 7. AGGREGATION WITH 
INDEPENDENT WORKS 

A compilation of the Document or its derivatives 
with other separate and independent documents or 
works, in or on a volume of a storage or distribution 
medium, is called an "aggregate" if the copyright re- 
sulting from the compilation is not used to limit the 
legal rights of the compilation’s users beyond what 
the individual works permit. When the Document 
is included in an aggregate, this License does not 
apply to the other works in the aggregate which are 
not themselves derivative works of the Document. 

If the Cover Text requirement of section 3 is appli- 
cable to these copies of the Document, then if the 
Document is less than one half of the entire aggre- 
gate, the Document’s Cover Texts may be placed 
on covers that bracket the Document within the 
aggregate, or the electronic equivalent of covers 
if the Document is in electronic form. Otherwise 
they must appear on printed covers that bracket 
the whole aggregate. 8. TRANSLATION 


Translation is considered a kind of modification, so 
you may distribute translations of the Document 
under the terms of section 4. Replacing Invariant 
Sections with translations requires special permis- 
sion from their copyright holders, but you may in- 
clude translations of some or all Invariant Sections 
in addition to the original versions of these Invari- 
ant Sections. You may include a translation of this 
License, and all the license notices in the Document, 
and any Warranty Disclaimers, provided that you 
also include the original English version of this Li- 
cense and the original versions of those notices and 
disclaimers. In case of a disagreement between the 
translation and the original version of this License 
or a notice or disclaimer, the original version will 
prevail. 

If a section in the Document is Entitled "Acknowl- 
edgements", "Dedications", or "History", the re- 
quirement (section 4) to Preserve its Title (section 
1) will typically require changing the actual title. 
9. TERMINATION 

You may not copy, modify, sublicense, or distribute 
the Document except as expressly provided under 
this License. Any attempt otherwise to copy, mod- 
ify, sublicense, or distribute it is void, and will 
automatically terminate your rights under this Li- 
cense. 

However, if you cease all violation of this License, 
then your license from a particular copyright holder 
is reinstated (a) provisionally, unless and until the 
copyright holder explicitly and finally terminates 
your license, and (b) permanently, if the copyright 
holder fails to notify you of the violation by some 
reasonable means prior to 60 days after the cessa- 


Moreover, your license from a particular copyright 
holder is reinstated permanently if the copyright 
holder notifies you of the violation by some reason- 
able means, this is the first time you have received 
notice of violation of this License (for any work) 
from that copyright holder, and you cure the vi- 
olation prior to 30 days after your receipt of the 
notice. 

Termination of your rights under this section does 
not terminate the licenses of parties who have re- 
ceived copies or rights from you under this License. 
If your rights have been terminated and not perma- 
nently reinstated, receipt of a copy of some or all 
of the same material does not give you any rights 
to use it. 10. FUTURE REVISIONS OF THIS LI- 
CENSE 

The Free Software Foundation may publish new, re- 
vised versions of the GNU Free Documentation Li- 
cense from time to time. Such new versions will be 
similar in spirit to the present version, but may dif- 
fer in detail to address new problems or concerns. 
See http://www.gnu.org/copyleft/. 

Each version of the License is given a distinguish- 
ing version number. If the Document specifies that 
a particular numbered version of this License "or 
any later version" applies to it, you have the op- 
tion of following the terms and conditions either of 
that specified version or of any later version that 
has been published (not as a draft) by the Free Soft- 
ware Foundation. If the Document does not specify 
a version number of this License, you may choose 
any version ever published (not as a draft) by the 
Free Software Foundation. If the Document speci- 
fies that a proxy can decide which future versions of 


this License can be used, that proxy’s public state- 
ment of acceptance of a version permanently autho- 
rizes you to choose that version for the Document. 
11. RELICENSING 

"Massive Multiauthor Collaboration Site" (or 
"MMC Site") means any World Wide Web server 
that publishes copyrightable works and also pro- 
vides prominent facilities for anybody to edit those 
works. A public wiki that anybody can edit is 
an example of such a server. A "Massive Multiau- 
thor Collaboration" (or "MMC") contained in the 
site means any set of copyrightable works thus pub- 
lished on the MMC site. 

"CC-BY-SA" means the Creative Commons 
Attribution-Share Alike 3.0 license published by 
Creative Commons Corporation, a not-for-profit 
corporation with a principal place of business in 
San Francisco, California, as well as future copyleft 
versions of that license published by that same 
organization. 

"Incorporate" means to publish or republish a Doc- 
ument, in whole or in part, as part of another Doc- 
ument. 

An MMC is "eligible for relicensing" if it is licensed 
under this License, and if all works that were first 
published under this License somewhere other than 
this MMC, and subsequently incorporated in whole 
or in part into the MMC, (1) had no cover texts or 
invariant sections, and (2) were thus incorporated 
prior to November 1, 2008. 

The operator of an MMC Site may republish an 
MMC contained in the site under CC-BY-SA on the 
same site at any time before August 1, 2009, pro- 
vided the MMC is eligible for relicensing. ADDEN- 
DUM: How to use this License for your documents 

To use this License in a document you have written, 
include a copy of the License in the document and 
put the following copyright and license notices just 
after the title page: 

Copyright (C) YEAR YOUR NAME. Permission is 
granted to copy, distribute and/or modify this doc- 
ument under the terms of the GNU Free Documen- 
tation License, Version 1.3 or any later version pub- 
lished by the Free Software Foundation; with no 
Invariant Sections, no Front-Cover Texts, and no 
Back-Cover Texts. A copy of the license is included 
in the section entitled "GNU Free Documentation 
License" . 

If you have Invariant Sections, Front-Cover Texts 
and Back-Cover Texts, replace the "with . . . 
Texts." line with this: 

with the Invariant Sections being LIST THEIR TI- 
TLES, with the Front-Cover Texts being LIST, and 
with the Back-Cover Texts being LIST. 

If you have Invariant Sections without Cover Texts, 
or some other combination of the three, merge 
those two alternatives to suit the situation. 

If your document contains nontrivial examples of 
program code, we recommend releasing these exam- 
ples in parallel under your choice of free software 
license, such as the GNU General Public License, 
to permit their use in free software. 


9.3 GNU Lesser General Public License 


GNU LESSER GENERAL PUBLIC LICENSE 
Version 3, 29 June 2007 

Copyright © 2007 Free Software Foundation, Inc. 
<http://fsf.org/> 

Everyone is permitted to copy and distribute verba- 
tim copies of this license document, but changing 
it is not allowed. 

This version of the GNU Lesser General Public Li- 
cense incorporates the terms and conditions of ver- 
sion 3 of the GNU General Public License, supple- 
mented by the additional permissions listed below. 
0. Additional Definitions. 

As used herein, “this License” refers to version 3 
of the GNU Lesser General Public License, and the 
“GNU GPL” refers to version 3 of the GNU General 
Public License. 

“The Library” refers to a covered work governed by 
this License, other than an Application or a Com- 
bined Work as defined below. 

An “Application” is any work that makes use of an 
interface provided by the Library, but which is not 
otherwise based on the Library. Defining a subclass 
of a class defined by the Library is deemed a mode 
of using an interface provided by the Library. 

A “Combined Work” is a work produced by com- 
bining or linking an Application with the Library. 
The particular version of the Library with which 
the Combined Work was made is also called the 
“Linked Version”. 

The “Minimal Corresponding Source” for a Com- 
bined Work means the Corresponding Source for 
the Combined Work, excluding any source code for 
portions of the Combined Work that, considered in 
isolation, are based on the Application, and not on 
the Linked Version. 


The “Corresponding Application Code” for a Com- 
bined Work means the object code and/or source 
code for the Application, including any data and 
utility programs needed for reproducing the Com- 
bined Work from the Application, but excluding the 
System Libraries of the Combined Work. 1. Excep- 
tion to Section 3 of the GNU GPL. 

You may convey a covered work under sections 3 
and 4 of this License without being bound by sec- 
tion 3 of the GNU GPL. 2. Conveying Modified 
Versions. 

If you modify a copy of the Library, and, in your 
modifications, a facility refers to a function or data 
to be supplied by an Application that uses the fa- 
cility (other than as an argument passed when the 
facility is invoked), then you may convey a copy of 
the modified version: 

* a) under this License, provided that you make a 
good faith effort to ensure that, in the event an Ap- 
plication does not supply the function or data, the 
facility still operates, and performs whatever part 
of its purpose remains meaningful, or * b) under 
the GNU GPL, with none of the additional permis- 
sions of this License applicable to that copy. 

3. Object Code Incorporating Material from Li- 
brary Header Files. 

The object code form of an Application may incor- 
porate material from a header file that is part of 
the Library. You may convey such object code un- 
der terms of your choice, provided that, if the in- 
corporated material is not limited to numerical pa- 
rameters, data structure layouts and accessors, or 
small macros, inline functions and templates (ten 
or fewer lines in length), you do both of the follow- 
ing: 

* a) Give prominent notice with each copy of the 
object code that the Library is used in it and that 
the Library and its use are covered by this License. 

* b) Accompany the object code with a copy of the 
GNU GPL and this license document. 


4. Combined Works. 


You may convey a Combined Work under terms of 
your choice that, taken together, effectively do not 
restrict modification of the portions of the Library 
contained in the Combined Work and reverse en- 
gineering for debugging such modifications, if you 
also do each of the following: 

* a) Give prominent notice with each copy of the 
Combined Work that the Library is used in it and 
that the Library and its use are covered by this Li- 
cense. * b) Accompany the Combined Work with a 
copy of the GNU GPL and this license document. * 
c) For a Combined Work that displays copyright no- 
tices during execution, include the copyright notice 
for the Library among these notices, as well as a ref- 
erence directing the user to the copies of the GNU 
GPL and this license document. * d) Do one of the 
following: o 0) Convey the Minimal Corresponding 
Source under the terms of this License, and the Cor- 
responding Application Code in a form suitable for, 
and under terms that permit, the user to recombine 
or relink the Application with a modified version 
of the Linked Version to produce a modified Com- 
bined Work, in the manner specified by section 6 of 
the GNU GPL for conveying Corresponding Source, 
o 1) Use a suitable shared library mechanism for 
linking with the Library. A suitable mechanism 
is one that (a) uses at run time a copy of the Li- 
brary already present on the user’s computer sys- 
tem, and (b) will operate properly with a modified 
version of the Library that is interface-compatible 
with the Linked Version. * e) Provide Installation 
Information, but only if you would otherwise be re- 
quired to provide such information under section 6 
of the GNU GPL, and only to the extent that such 
information is necessary to install and execute a 
modified version of the Combined Work produced 
by recombining or relinking the Application with 
a modified version of the Linked Version. (If you 
use option 4d0, the Installation Information must 
accompany the Minimal Corresponding Source and 
Corresponding Application Code. If you use option 
4dl, you must provide the Installation Information 
in the manner specified by section 6 of the GNU 
GPL for conveying Corresponding Source.) 


5. Combined Libraries. 

You may place library facilities that are a work 
based on the Library side by side in a single library 
together with other library facilities that are not 
Applications and are not covered by this License, 
and convey such a combined library under terms of 
your choice, if you do both of the following: 

* a) Accompany the combined library with a copy 
of the same work based on the Library, uncombined 
with any other library facilities, conveyed under 
the terms of this License. * b) Give prominent no- 
tice with the combined library that part of it is a 
work based on the Library, and explaining where 
to find the accompanying uncombined form of the 
same work. 

6. Revised Versions of the GNU Lesser General 
Public License. 

The Free Software Foundation may publish revised 
and/or new versions of the GNU Lesser General 
Public License from time to time. Such new ver- 
sions will be similar in spirit to the present version, 
but may differ in detail to address new problems or 
concerns. 

Each version is given a distinguishing version num- 
ber. If the Library as you received it specifies that 
a certain numbered version of the GNU Lesser Gen- 
eral Public License “or any later version” applies to 
it, you have the option of following the terms and 
conditions either of that published version or of any 
later version published by the Free Software Foun- 
dation. If the Library as you received it does not 
specify a version number of the GNU Lesser Gen- 
eral Public License, you may choose any version of 
the GNU Lesser General Public License ever pub- 
lished by the Free Software Foundation. 

If the Library as you received it specifies that a 
proxy can decide whether future versions of the 
GNU Lesser General Public License shall apply, 
that proxy’s public statement of acceptance of 
any version is permanent authorization for you to 
choose that version for the Library. 



